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RATIONAL APPROXIMATIONS TO IRRATIONAL COMPLEX 
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BY 


LESTER R. FORD 


We shall call a number m + ni, where i = V— 1, a complex integer if m 
and n are real integers. By a rational complex fraction we shall mean a frac- 
tion whose numerator and denominator are complex integers. By an irra- 
tional complex number we shall mean a number not expressible as a rational 
complex fraction. 

Hermitet has proposed a method of forming a suite of rational complex 
fractions approaching an irrational complex number w. The method is 
briefly the following. Consider the positive definite hermitian form 


(1) F(x,y) = (4 — wy) (& — 09) + yg, 


where Z, 7, w are the conjugate imaginaries of x, y, w, and where k is real. 
Let x = p, y = q be a pair of complex integers giving a minimum value of 
the form F (the values x = 0, y = 0 being excluded). Then the fraction p/q 
is an approximation to w satisfying the inequality 


(2) 


al = 


If k be made to decrease from @ to 0, the integers p and q change from time 
to time, and there results an infinite suite of fractions tending toward w. 
If w be a rational complex fraction, a suite is likewise determined, but it 
terminates with the fraction w after a finite number of terms. 

It is the object of the first part of this paper to investigate the suite of 
Hermite, to give the fractions a geometrical interpretation, to find a method 
of calculating them, and to study their properties.t 

The second part of the paper is devoted to approximations by means of 

* Part II presented to the Society, Sept. 4, 1916, under the title Regular continued fractions. 

t Sur la théorie deg formes quadratiques, Journal fir Mathematik, vol. 47 
(1854), particularly pp. 357-366. 

t The principal results of this part have been presented in a brief note in Comptes 
Rendus, vol. 162 (1916), pp. 459-461. 
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continued fractions. It is shown that the convergents of any continued 
fraction of the form 


(3) 


where €, = +1 or +7, and where s, is a complex integer (such a fraction 
will be called regular), can be derived by a method which is a generalization 
of the method of Hermite. The properties of the convergents of such a con- 
tinued fraction, in particular their values as approximations to the sum of . 
the fraction, are investigated, and theorems relative to the periodicity of 
the continued fraction are established. 

I take occasion here to express my indebtedness to Professor Humbert of 
the Collége de France, who suggested to me a study of the suite of Hermite, 
and who pointed the way to its geometrical interpretation in his investigation 
of the analogous problem for the case of reals.* 


PART I 


THE FRACTIONS OF HERMITE 


1. The-Group of Picard. The whole theory of rational approximations to 
complex numbers is concerned with the group of transformations 


,_ +B 
yz + 6 


where a, 8, y, 6 are complex integers. This group is known as the Group of 
Picard. The group is not properly discontinuous in the z-plane, but if it be 
defined as a group of space transformations according to the idea of Poincarét 
a fundamental polyhedron for the group exists. The space transformations, 
where ¢ is a variable measured along an axis through the origin perpendicular 
to the z-plane, are 


(4) z (ad — By = 1) 


_ paa + 2a8 + 2a8 + BB 


yy +26 + + 
_ pay + zad + + 
+ + 276 + 65’ 
_ pray + 278 + + Bd 
+ + 276 + 66’ 
where p” = (” +2/2 and p? = & + 22.8 Combining these equations we 


*Comptes Rendus, vol. 161 (1915), pp. 717-721; Journal de mathé- 
matiques, 7th ser., vol. 2 (1916), pp. 79-103. 

t Mémoire sur les groupes kleinéens, Acta Mathematica, vol.3 (1884), pp. 49-92. 

§ Poincaré, loc. cit., p. 54. 
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get also 


(6) 


vy + + 276 + 


For points of the z-plane the transformation (5) is equivalent to (4). 

The transformation inverse to (4) is 
(7) 
hence the inverse space transformation can be written down by replacing 
a,a,6,d5by —6, —5, — a, — @ respectively in (5) and (6). 

These space transformations preserve angles both in magnitude and sign, 
transform spheres into spheres and circles into circles (planes and straight 
lines being included as special cases of spheres and circles), and transform 
figures inverse with respect to a sphere into figures inverse with respect to the 
transformed sphere. These properties are a consequence of the fact that 
each transformation is equivalent to an even number of inversions in spheres. 

The fundamental polyhedron for the Group of Picard has been found by 
Bianchi* to be the pentahedron lying above the sphere + 7° + ° = 1 

z = £ + in) and enclosed by the four planes = = 3,9 = —3,9 
The transforms of this region by the transformations of the group fill up, with- 
out overlapping, the whole space above the z-plane. This division of the 
half-space can also be derived from the fundamental pentahedron by inverting 
it in each of its faces, inverting each new pentahedron in each of its faces, and 
so on ad infinitum. 

In what follows we shall find it convenient to adjoin to the fundamental 
pentahedron that arising from it by reflection in the plane = 0. This 
double pentahedron, which we shall henceforth call the fundamental penta- 
hedron, then lies above the sphere & + 7? + ¢° = 1 and is bounded by the 
planes £ = +3, » = +3. The four plane faces we shall call the lateral 
faces of the pentahedron, and the remaining face we shall call its base. Of the 
five vertices, that at © we shall call the peak, reserving for the other four the 
name vertices. The vertices lie in the plane ¢ = 3 v2. 

If a transformation of the group be applied to this pentahedron there 
results a pentahedron bounded by spheres orthogonal to the z-plane, and 
we shall apply to the faces and vertices of this pentahedron the names of the 
corresponding parts in the fundamental pentahedron. Thus the four lateral 
faces meet in a point, the peak, in the z-plane. The four vertices and the 
base lie in the space above the z-plane. 

The repetitions of the fundamental pentahedron by the translations of 
the group z’ = z + @ fill the whole space outside the spheres of unit radius 


*Mathematische Annalen » vol. 38 (1891), pp. 313-333. 
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with centers at the points of the z-plane whose z-coérdinates are complex 
integers. Each of these pentahedra has its peak at and its vertices in the 
plane ¢ = 

The transformation z’ = — 1/z carries the fundamental pentahedron into 
another with the same base and with its peak at the origin. By the trans- 
formations z’ = z + 6 this pentahedron is carried into others with bases in 
common with those of the preceding paragraph and with peaks at the points 
of the z-plane whose coérdinates are complex integers. 

2. The positive definite hermitian form. ‘The binary form 


(8) (x,y) = + bag + bay + cyg, 


where a and ¢ are real, is called a hermitian form. It is definite if the deter- 
minant, D = bb — ac, is negative. We shall suppose that a > 0; that is, 
that the form is positive. If we set x/y = z and equate (8) to zero, we obtain 


the equation of an imaginary circle in the z-plane, 
(9) azz + bz + b2 =0. 
The one-parameter family of spheres through this circle contains two real 


point spheres; these are, if we write b = b, + tbe, at the points 


b be 
(10) =>, f= 


a a 

The point lying in the upper half space (¢ = +~V¥— D/a) is customarily 

chosen as the representative point of the form.* This point has the property 
that if x and y be replaced by the new variables 

X = ax + By, Y = yx + by (ai — By =m +0) 


where m is real, the representative point of the new form is derived from 
that of the old by the space transformation corresponding to 


7 
aes yz + 5° 
In the form (1) of the Introduction D = — k*. Writing w = w; + tw2 we 


find the representative point of the form to be 


Gis 
or 
(11) f= k, 
where k is the positive square root of k*. As k decreases from © to 0, the 
representative point traces the half of the line z = w lying in the upper half 
space, passing from © to the z-plane. 


* Fricke-Klein, Theorie der automorphen Functionen, vol. 1, p. 455. 
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Minimum of the form. As a step toward finding the minimum of the 
form (8)—that is, the minimum for complex integral values of x and y (x = 0, 
y = 0 excluded)—we shall establish the following proposition. 

If the representative point of the form lies in a pentahedron whose peak is at 
infinity the minimum of the form is the first coefficient. 

The first coefficient is a permissible value, being given by x = 1, y = 0. 
Obviously x = p, y = 0 where |p| > 1 yields a greater value. We shall 
treat two cases: = p, y= 1, and x = p, y = q, where |g| > 1. Thus 
all possible values of x and y are considered. 

The representative point of the form lies outside the unit sphere whose 
center is z = p; that is, outside the sphere 


(2-—p)(2-p) +P =1. 
The coérdinates of the representative point from (10) are 
¢ = Vac —bb/a, 


and these then satisfy the inequality 


(-2- \(-3 ac — bb 
a ?P a + a 
whence 


(12) app + bp + bp +e=a. 


That is, 6(p, 1) 2a, whatever the integer p may be. 
The representative point also lies above the plane ¢ = 


| 
bo 
= 


But 
(p,q) = app + bpg + bpq + 


bq a ac — bb _ 
(13) a p+ a p+ a + a2 949 


aqq _ 


9 = 


since qg = 2. The proposition is thus established. 

Suppose now that the representative point does not lie in a pentahedron 
whose peak is at ©. We observe that the set of values of the form given by 
complex integral values of the variables is unchanged if we make the sub- 
stitution 


(14) wx=aX+BY, y=yX+6Y, By=1, 


a, 8, y, 6 being complex integers. For complex integral values of X and Y 


| 

4 
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yield complex integral values of x and y, and conversely. The new form in 
X and Y will have the same minimum as the original. 


Now et 
(15) 


+6 
be the transformation of the Group of Picard carrying the fundamental 
pentahedron into the pentahedron in which the representative point of the 
form (8) lies; and let the coefficients a, 8, y, 6 of (15) be used in the sub- 
stitution (14). Then the representative point of the new form lies within 
the fundamental pentahedron, and its minimum is the first coefficient. The 
new form we find, on making the substitution (14), to be 
(16) ¢'(X, Y) = (aaa + bay + bay + cyy) XX +---. 
The first coefficient is then (a, y). 

The peak of the pentahedron in which the representative point of the form 
@(2x,y) lies is the transform of Z = ~; that is, z = a/y. We have thus 
proved the first part of the following theorem: 

THEOREM. If a/y (in its lowest terms) is the z-codrdinate of the peak of the 
pentahedron in which the representative point of the form (2x, y) lies, the mini- 
mum of the form is ¢(a, 

Conversely, if 6 (a, y) is the minimum of the form the peak. of the pentahedron 
in which its representative point lies has the z-coérdinate a/y . 

The converse follows at once in case the form attains its minimum for a 
single pair of variables. We see from the derivation of (12) and (13) that 
the minimum of the form is given by a single pair of values in general. Now 
(12) becomes an equality only if the representative point lies in a base common 
to two pentahedra and (13) becomes an equality if the representative point 
lies in a vertex, where several pentahedra meet. In these exceptional cases 
there are two or more pairs of values giving the minimum, but we easily show 
that this arises from the fact that the representative point may be said to 
belong to two or more pentahedra. 

3. Geometrical interpretation of the fractions of Hermite. The representa- 
tive point of the form (1) of the Introduction we found (11) to be z = w, 
¢ =k. The fraction of Hermite for a given value of k is p/q where x = p, 
y = q gives the minimum of the form; that is, it is the z-céordinate of the 
peak of the pentahedron in which the representative point of the form lies. 
As k decreases from « to 0 the representative point traces the half-line z = w 
from © to the z-plane. We have then the following interpretation: 

Let a moving point trace the half of the line z = w lying in the upper half-space, 
passing from « to the z-plane. The z-codrdinates of the peaks of the successive 
pentahedra through which this point passes are the fractions of Hermite tending 
toward the value w. 
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From this interpretation it is obvious that there are only a finite number 
of fractions if w is rational. The moving point eventually enters and remains 
within pentahedra with peaks at w, and the suite terminates. 

4. The S-and >-spheres. In this section we shall deduce certain geometrical 
facts concerning the pentahedral division of the half-space which will be 
frequently used in the following pages. 

A plane, ¢ = h, parallel to the z-plane is transformed by a transformation 
of the group, 2’ = (az + 8)/(yvz + 6), into a sphere tangent to the z-plane. 
We get its equation from the inverse of equation (6). It is 


YY — zya — Fa t+ aa’ 


Since p? = 22 + ¢, this can be written 


a @ 1 ¥ 1 


From this we see that the center of the sphere is the point z = a/y, ¢ = 1/2hy7, 
and that its radius is 1/2hyy. The sphere is thus tangent to the z-plane at 
the point z = a/y. 

There are two planes of particular importance of the form ¢ = h. One of 
these is ¢ = 3 V2, which contains the vertices of all pentahedra with peaks 
at o. By the transformation mentioned above this becomes a sphere of 
radius 1/ v2y7 tangent to the z-plane at z = a/y. We shall call this sphere 
(a/v). The transformation in question carries z= © to 2 =a/y. 
The part of the plane above ¢ = } v2 is carried into the interior of = (a/y). 
Hence all the pentahedra with peaks at z = a/y lie within the sphere = (a/y) 
and their vertices lie upon its surface. 

A second important plane is £ = 1, which is tangent to the bases of all the 
pentahedra with peaks at «. It is transformed into a sphere of radius 1/2yy 
tangent to the z-plane at a/y. This sphere we shall call S(a/y). We see 
then that the bases of all pentahedra with peaks at z = a/y are tangent to the 
sphere S(a/y). 

It is well known that if p and q are any two complex integers without a 
common factor, other two complex integers, 6 and 6, can be found such 
that pi — Bg =1. Thenz’ = (pz + B)/(qz + 4) isa transformation of the 
Group of Picard which carries z = «© toz’ = p/q. Hence, each rational point, 
p/q, of the z-plane is a peak for infinitely many pentahedra. The vertices of 
these pentahedra lie upon the surface of = (p/q) and their bases are tangent to 
S(p/q). 

We shall find it convenient to represent the point « by 1/0. For the 
sake of uniformity we shall designate the planes ¢ = 1 and ¢ = 32 by 


q 
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S(1/0) and (1/0) respectively. The space above these planes corresponds 
to the interiors of the other S- or =-spheres. 

Consider now S(p/q). Its diameter is1/qqg. The sphere S (p/q) will lie 
below ¢ = 1 in general, but will be tangent if gg = 1. In this latter case 
p/q = 8, a complex integer, and we know that a pentahedron with peak at s 
and another with peak at » have a base in common. ‘The point of tangency 
lies in the common base. Since ¢ = 1 can be transformed into any other S- 
sphere we have the result: 

The sphere S(p'/q’) will be tangent to the sphere S(p/q) uf a pentahedron 
with peak at p’/q’ has a base in common with a pentahedron with peak at p/q; in 
all other cases the spheres are entirely exterior to one another. 

The diameter of =(p/q) is v2/qqg. If qq =1, so that p/q = 3/1, the 
center is atz = s, f= 1/ 2. The center thus lies in the plane ¢ = 142, 
and {(s/1) and =(1/0) cut orthogonally. Again, if gq = 2 (that is, g = 
+1-+7) the diameter of =(p/q) is and =(p/q) touches 2(1/0). 
By a simple division we can put p/q in the form s+ 1/(1 —7), or s+ 
(1+72)/2, wheres is an integer. The point of tangency is z = s + (1+7)/2, 
t=} 42, a vertex of certain of the pentahedra with peaks at o. It follows 
that in this case certain pentahedra with peaks at p/q must also have vertices 
at this point of tangency. If gq > 2, =(p/¢q) lies entirely below ¢ = 3 v2. 
Since ¢ = }2can be transformed into any other =-sphere we have the result: 

The sphere = (p’/q' ) will intersect the sphere = (p/q) orthogonally if a penta- 
hedron with peak at p’/q' and a pentahedron with peak at p/q have a common 
base; = (p'/q’) will touch = (p/q) uf a pentahedron with peak at p’/q’ and a 
pentahedron with peak at p/q have a common vertex but no common base; the two 
2-spheres will be entirely exterior to one another in all other cases. 

The analytical condition for a common base is expressed in the following 

THEeoreM. The necessary and sufficient condition that a pentahedron with 
peak at p’/q’ and a pentahedron with peak at p/q (the fractions being in their 
lowest terms) have a common base is that pq’ — p’q = + lor +1. 

If two such pentahedra have a common base, let 2’ = (az + 8)/(yz + 6) 
be the transformation of the Group of Picard carrying the fundamental 
pentahedron into the one with peak at p/qg. Then the pentahedron with 
peak at the origin and with the same base as the fundamental pentahedron is 
carried into the one with peak at p’/q’. But z = © and z = 0 become re- 
spectively a/y and 8/6, and the relation ad — By = 1 shows that a and 7, 
also 8 and 6, are without a common factor. Since p/q = a/v, p’/q’ = 6/6, 
we have p = ea, q = ey, p’ = 18, 7 = 76, where € and 7 are of the form 
+lor+z7. Then 


py — = en(ad — By) = +i. 


The condition is thus necessary. 
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It is also sufficient. For if pq’ — p’¢ = +1 or +7 the transformation 
= (pz + ep’)/(qz + eq’), where = +1 or ¥ respectively, has deter- 
minant 1 and is thus of the Group of Picard. It transforms « and 0 into 
p/q and p’/q’ respectively. Now a pentahedron with peak at © and one 
with peak at 0 have a common base, and these are carried into two with a 
common base and with peaks at p/q and p’/q’. The theorem is thus estab- 
lished. 

5. The calculation of the fractions of Hermite. According to the interpre- 
tation (Section 3) the calculation of the fractions reduces to the problem of 
finding the z-coérdinates of the successive peaks of the pentahedra through 
which the line z = w passes. We observe that two pentahedra with a common 
lateral face have the same peak; hence a moving point on the line z = w can 
pass from one pentahedron into another with a different peak only by passing 
through the common base of the two pentahedra.* An application of the 
theorem at the close of the preceding section gives the following result: 

If p’/q’ and p/q are successive fractions of Hermite in the development of a 
number w, py’ — = +lor+i.t 

By multiplying p and q by a suitable one of the numbers + 1, + 7, we can 
have pq’ — p’ q equal always to any one of the four quantities desired. The 
method of calculation to be given yields always pq’ — p’ g = — 1, where p/q 
is the fraction following p’/q’. 

The point z = w, ¢ = k lies in a pentahedron with peak at ~ if k is large. 
The first fraction is then 1/0, whatever the value of w. The pentahedra with 
bases in common with those with peaks at ~ have their peaks at the complex 
integers, and the common bases project orthogonally upon the z-plane into 
unit squares whose sides are parallel to the coérdinate axes and whose centers 
are the integers. The second peak is then at a, the nearest complex integer to w; 
that is, an integer such that the real part and the imaginary part of w — a are 
each less than or equal to 3 in absolute value. Writing the second fraction 
a/1 

We set now the following problem: 

Given two successive fractions of Hermite, p'/q’ and p/q, in the development 
of a number w, where py’ — p’q = — 1, to find the following fraction, P/Q. 

The geometrical interpretation of the problem is shown in Figure 1, p. 10. 


* If the line passes through a vertex of the first pentahedron the moving point may pass 
into a pentahedron having only a vertex in common with the first. But the vertex belongs 
also to other pentahedra, one of which has a base in common with the two preceding and can 
be taken as intermediate between them. 

This case does not arise if w is irrational, the z-coédrdinate of a vertex being always 
rational. For the vertex is the transform of one of the vertices of the fundamental pen- 
tahedron, z = (+1-+i)/2, ¢ = 32, and we see from the second of equations (5) that 
with these values of z and ¢, z’ is rational. 

t This result appears in Hermite’s memoir. 
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As k decreases the moving point in the line z = w passes from a pentahedron 
D, with peak at p’/q’ , traversing its base, and entering a pentahedron D2 with 


Fic. 1 


peak at p/q. After passing through a certain number of pentahedra with 
peaks at p/q it passes from one of them, D;, by its base and enters a penta- 
hedron, D,, with its peak at P/Q, the fraction sought. 

Let us apply to this configuration the transformation 


qz— p’ 
(18) 


which, since its determinant is 1, is of the Group of Picard. The transformed 
configuration is shown in Figure 2, D;, D:, D3, Di being the transforms of 


| 

| 

| 
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D,, De, D3, Ds. The fractions p/q and p’/q’ become respectively ~ and 0. 
The pentahedron D; is then the fundamental pentahedron, and D; is the 
pentahedron with the same base. The pentahedron Dj also has its peak at 
co , and D;, which has the same base, has its peak at a complex integer s, to 
be found. 


Writing «’ and w’ for the transforms of « and w, we have 


where u = q(qw—p). The half-line z = w is transformed into the semi- 
circle C orthogonal to the z-plane through these two points. (This follows 
from the fact that circles are transformed into circles and that angles are 
preserved.) In tracing this semicircle from «’ to w we enter the pentahedra 
with peaks at « by the base of D; and leave them by the base of D3. 

The circle C has its center in the z-plane at z = $( ©’ + w’) = q’/q — 1/2u, 
and its radius is }| ©’ — w’|, or 1/|2u|. Its equations can be written 


4uii’ 
(20) 


where ¢ is a real parameter. The first of these equations is the sphere with 
the same center and radius as C; the second is the plane through ©’ and w’ 
orthogonal to the z-plane. The parameter ¢ is positive for points between «’ 
and the highest point of C and negative for points between the highest point 
and w’. 

The base common to D3 and Dj is the unit sphere with center at the point 
z = s in the z-plane, 


(21) +P =1. 


The z-coérdinate of the point of intersection of the circle (20) with the 
sphere (21) we find after an easy calculation to be 


1 
1 1 


u q wa ° q 2u 


In order that the point of intersection lie in the base of D3, 2; must lie within 
the unit square whose center is s and whose sides are parallel to the codrdinate 
axes. This condition is also sufficient to determine s if we rule out the case 
s = 0, which is the peak of D;. Otherwise stated, s is the complex integer 


1 t 
q 2 u 
| 
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different from zero such that z; — s has a real and an imaginary part each less 
than or equal to } in absolute value. Putting U = q’/q — 1/2u, we can write 
z; — 8 in the following form 
(23) y —, 9) (0-5) wh 
u(uU + aU) 

Having found s to satisfy the above condition we can transform back to the 
original configuration by means of (18), i. e., 2 = (pZ — p’)/(qZ —q’). 
Hence we have 


(24) 


and we can take 
(25) P = ps — p’, Q =qs 


P_ps—p' 
gs—q 


We note also that Pg — pQ = — 1. 

The determination of s from the expression (23) is not practicable, and a 
method will now be devised in which it will be necessary to have recourse to 
(23) only in certain doubtful cases. 

The z-coérdinate of the point of intersection of the semicircle C with the 
plane ¢ = 3 V2 (which passes through the vertices of the pentahedra with 
peaks at « ) will differ but little from that of the intersection with the base 


sought. Combining the equations of (20) and putting ¢? = 4 we have 


whence 


t= +41 —2ui. 


The negative value of ¢ gives the intersection nearest w’. From the second 
of equations (20) we have for zo, the intersection sought, 

(26) + vi — Qui). 

Let 8 be the integer nearest 29. Under what conditions is 9 the integer s 
of condition (23)? The semicircle C either intersects the base B lying above 
zo, or intersects an adjoining base and passes under one of the sides bounding 
the base before reaching the plane ¢ = }v2.* Hence s is one of the integers 
89, 81, 82, Where 8; and 82 are the peaks of two of the pentahedra with bases ad- 
joining B. Since C passes through the base above the origin before arriving at 
the bases just mentioned, |s,| and |s2| are less than |s9|. Also |s; — s| = 1 
and |s2 — | = 1, since the bases above s; and s2 adjoin B. (That is, one has 


* This latter possibility renders the calculation of the fractions a more complicated pro- 
cedure than in the case of reals. 
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a real part, the other an imaginary part, differing by one unit from that of 89. ) 
Now the semicircle C can intersect each sphere with center in the z-plane 
but once. Hence if C intersects the base above s:, say, the later point of 
intersection of C with the plane ¢ = 3 V2 must still be within the unit sphere 
with center s;, which forms that base. We find easily that ¢ = 3 V2 intersects 
this sphere in a circle of radius 142. Hence if (zo — 81) (Zo — #1) 23, C 
cannot have intersected the base above s;. 

We can now state the following rule of calculation, which would appear to 
be in as convenient a form as possible for actual use. 

Let so be the nearest complex integer to the number zo, where 

1 

(27) % = — 5, (1 + V1 — 2ui). 

Determine the complex integer s as follows. Let 81, 82 be the two (in general 
two) complex integers such that 


(28) lso—si| =1, < (¢ =1,2). 
If neither of the inequalities 


(29) (z — (Zo — 81) <3 (¢ =1,2), 
is satisfied, s = 8. 

If the inequality is satisfied for i = 1 (or 2),* substitute 8, into the expression 
(23). If this expression has a real and an imaginary part each less than or equal 
to 4 in absolute value, s = 89; uf not, s = 8; (or 82). 

Then 
(30) P=ps—p', 

Beginning with the two fractions 1/0 and a/1 and applying this method 
of procedure, we can calculate step by step the fractions of the suite. 

In applying this process it is not necessary in the majority of cases to have 
recourse to condition (23). If such recourse is necessary there is recompense 
sometimes in the fact that P,/Q,, the fraction following P/Q, can be found at 
once. For if s = s; (j = 1 or 2) in the preceding rule, we know that the 
next peak determined by C is at 89, for the point of intersection of C with the 
plane ¢ = } v2 certainly lies in a pentahedron with peak at so. Therefore, 
if P/Q = (ps; — p’)/(qsi — 9’), we have P;/Q: = (ps — p’)/(q80 — 9). 
This yields the result 

pso — p’ qso — 
(31) Pi = = 
the denominators, which are of the form + 1, + 7, being introduced in order 
to make — PQ, = 1. 


* The inequality cannot be satisfied for both s; and s2, since | s: — s2| = v2. 
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6. The modified hermitian suite. The preceding rule of calculation gives 
all of the fractions of Hermite in the development of a number w. We get a 
suite of fractions differing little from the true suite if in the preceding process 
we take always s = s). We thus dispense altogether with condition (23), 
and find P and Q directly from (27) and (30). As we have just seen this 
usually gives the fraction immediately following p/q, but in some cases it 
yields the second fraction after p/qg. ‘This modified suite thus differs from 
the original by the omission of certain fractions, but no two successive fractions 
of the original suite are omitted. 

It is easily seen that a fraction can be omitted only if the line z = w passes 
through but two pentahedra with peaks at the fraction, but that it is not always 
omitted under these circumstances. 

7. Properties of the fractions of Hermite. Let p/q be a fraction in the 
suite determined by the numberw. Then the line z = w intersects pentahedra 
with peaks at p/qg. As these pentahedra all lie within the sphere =(p/q), 
which has the radius 1/ ¥2qq, we have at once the inequality (2) of the Intro- 
duction* 


Pp 
32 
(32) ie q | 


v2qq° 
Since p/q is the minimum of the form (1) for a certain value of & the repre- 


sentative point of the form, z = w, ¢ =k, lies in a pentahedron with peak 
at p/q, and therefore lies in 2(p/q). This gives the condition 


= 


(33) (p — wq)(p — 07) +P Ske. 


This is the inequality from which Hermite, in the memoir cited, derives his 
results, and we thus see its geometrical meaning. 

We turn now to a comparison of p/q with the fraction P/Q which im- 
mediately follows it in the suite. 

Comparison of Q and q. Let M be the sphere forming the common base of 
the two pentahedra with peaks at p/q and P/Q. The line z = w, traced 
from © to the z-plane, enters this sphere through the common base, passing 
out of a pentahedron with peak at p/q and entering one with peak at P/Q. 
The point p/q of the z-plane is outside M while P/Q and also w are inside. 
Now let us consider S(p/q) and S(P/Q), which are tangent to the bases 
of all pentahedra with peaks at p/q and P/Q respectively. Since the one 


IIA 


* If w is irrational the sign of equality can be removed. For the only points of the penta- 
hedra on the equator of = (p/q) are vertices, and these have rational z-coérdinates. A 
similar remark is applicable to the next inequality. See Note, Section 5. 
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set of pentahedra can be derived from the other by an inversion in the common 
base, it follows that S(p/q) and S( P/Q) are inverse spheres with respect to 
M. The former lies outside M and the latter inside, and they are tangent 
at a point in the common base. The former sphere is therefore larger than 
the latter. Using the values of the radii found in Section 4, we have 1/2qq 
> 1/2QQ; or 

(34) > 


The convergence of the suite to the value w now follows at once. If pn/qn is 
written as the general term of the suite, (34) shows that |q,| approaches infinity 
as n increases without limit. The convergence then follows from (32). 

In order to examine more closely into the relation between Q and q let us 
find the equation of the sphere M. Any base is the transform of the base 
of the fundamental pentahedron, 22 + ¢? = 1, or p? = 1, by some transforma- 
tion of the group, z’ = (az + B)/(yz +6). We get the transform at once 
from the first of equations (5), using the inverse transformation, 


(22 + §*) 58 — 288 — 268 + BB 


(8% +) yy — 2ya — Fa t+ aa’ 


This can be written in the following form: 


65 — 65 — (65 — yy)? 
We see that this sphere has its center at the point z = (85 — ay)/(65 — yy) 
and that its radius is 1/|65 — y7|. 
Now the transformation carrying the base of the fundamental pentahedron 
into the base common to two pentahedra with peaks at p/q and P/Q is 
2’ = (Pz — p)/(Qz — q). Hence we find that the center of M is 


z = (PQ — pq)/(QQ — 


and its radius is 1/(QQ — qq). This may be stated as follows: 

If R is the radius of the sphere forming the common base between pentahedra 
with peaks at two successive fractions of Hermite, p/q and P/Q, Q and q satisfy 
the equation 


(36) QQ = 49 + 1/R. 

Representing the successive fractions in the suite by po/qo, pi/q1, °°; 
Pn/Qn, ***, we shall now prove the following: 

Writing Qn+1 Inti = Qn In + Nu, then 
(37) Lim N, = ©. 


To prove this it is sufficient, since N, ‘s an integer, to establish that there 
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are only a finite number of values of n yielding a given value N of N,; that 
is, that the line z = w can pass through only a finite number of spheres of the 
system forming bases whose radii are 1/N. The point w is within all these 
spheres, hence the spheres intersect. A reference to the fundamental penta- 
hedron shows that two spheres of the system dividing the half space into 
pentahedra intersect always at an angle 7/3 or 7/2. The equators of the 
spheres in the z-plane are equal circles mutually intersecting at these angles, 
and enclosing the point w. Obviously only a finite number of such circles 
exist, and the theorem is established. [A simple geometrical construction 
shows that not more than four equal circles enclosing a point and mutually 
intersecting at one or other of the given angles exist; hence not more than 
four values of N, are equal.] 

Comparison of P and p. If we make the transformation z’ = 1/z (which 
belongs to the group, for in writing it z’ = 7/iz it has the determinant 1), 
p/q and P/Q are transformed respectively into g/p and Q/P. The spheres 
S(p/q) and S(P/Q) become S(q/p) and S(Q/P) respectively. These 
latter spheres are tangent to and inverse with respect to M’, the transform 
of M, one lying within M’ and the other without. 

The transformation z’ = 1/z in the plane is geometrically equivalent to an 
inversion in the unit circle 22 = 1 followed by a reflection in the real axis. 
Then the corresponding space transformation is equivalent to an inversion 
in the unit sphere 22 + ¢° = 1 followed by a reflection in the plane n = 0. 
The latter transformation has no effect upon the dimensions of the figure. 

We consider three cases according as (a) the origin is without M, (b) the 
origin is within M, or (ce) the origin lies upon the surface of M. 

(a) If the origin is outside M the inversion in the sphere 22 + °° = 1, 
whose center is the origin, followed by the reflection mentioned above, trans- 
forms the interior and exterior of M into the interior and exterior respectively 
of M’. The sphere S(Q/P) is then inside M’ and S(q/p) is outside, 
and the latter is thus the larger. Their radii then give the inequality 
1/2pp > 1/2PP, or 
(38) |P| > |p]. 


(b) The interior of a sphere enclosing the pole of inversion is transformed 
into the exterior of the inverse sphere; hence in this case the interior and 
exterior of M become the exterior and interior respectively of M’. The 
sphere S(Q/P) is then outside M’ and S (q/p) is inside, and the former is the 
larger. Whence 1/2PP > 1/2p@, or 


(39) [P| < |p|. 


Now the only sphere of the system dividing the half-space into pentahedra 
which has the origin on its interior is the unit sphere z2 + ¢° = 1 itself. The 


i 
‘ 
| 
ul 


1918] IRRATIONAL COMPLEX NUMBERS 17 


surface of this sphere (as of any sphere of the system) is divided by the other 
spheres of the system into infinitely many regions. Of these regions an infinite 
number are bases of pentahedra and an infinite number are lateral faces. If 
the line z = w intersects the sphere in one of the bases there is a fraction P/Q 
of the suite for which |P| < |p|. Since the line can intersect the sphere 
but once there can not be more than one such fraction. If |w| = 1 there is 
obviously none. 

The points P/Q and p/q are inverse with respect to 22 = 1, the equator of 
M. Writing the equation of the inversion z’ = 1/2 we have P/Q = q/p. As 
the fractions in this equation are in their lowest terms P = eq, Q = ej, where 
e=+1lor+z2. The relation Pq — pQ = — 1 gives at once the following 
results: 

(40) Q=p, PP=pp-1, QQ=@+1. 

We shall now show that this exceptional fraction can be as far from the 
beginning of the suite as is desired. 

Given an integer n however large, a number w can be found such that in its 
hermitian suite, po/qo, Pi/qi, ***s |Pm+i| < |Pm|, where m is some integer 
greater than n. 

Let w’ be an irrational number on the circumference of the circle 22 = 1, 
and let po/qo, -*:, Pn/Qn, *** be its suite of Hermite, which contains an 
infinite number of fractions. The line z = w’ intersects only pentahedra 
outside the sphere 22 + ¢? = 1. Nowa’ can be varied slightly in any direction 
without altering the fact that the line z = w’ intersects pentahedra with peaks 
at po/qo, ***, Pn/qn (a finite number). The bases lying on the sphere 
22 + ¢? = 1 cluster in infinite number about each point of the equator. If 
we put w’ = w, where w is within the limits of variation allowable and where 
the line z = w intersects one of these bases, the suite for w has an exceptional 
fraction for which | pmii| < |pm|. As the pentahedra with peaks at Pmi1/Qm+1 
lie within the sphere, this fraction is subsequent to p,/qn, whose pentahedra 
lie without, and the theorem is established.* 

(c) If M passes through the origin, M’ is a plane. The sphere S(Q/P) is 
then the reflection of S(q/p) in this plane, and the two spheres are equal. 
Whence 1/2PP = 1/2pp, or 
(41) |P| = |pl. 

There are infinitely many spheres of the system passing through the origin. 
They are the spheres which result from inverting the planes of the system 
inthe spherezz + = 1. 
The line z= w can be chosen to intersect as many of these spheres as is desired, 
but we shall see that the number that can be intersected in bases is limited. 


* This result is in striking contrast with that in the case of reals, where such a fraction can 
appear only as the second fraction of a hermitian suite—thus 1/0,0/1, ---. 
Trans. Am, Math. Soc. 2 
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It will be easier to consider the planes into which these spheres are trans- 
formed. By the transformation z’ = 1/z the half-line z = w is carried into a 
semicircle through the origin and the point 1/w. By taking 1/w sufficiently 
large the semicircle can be made to intersect as many of the planes as is desired. 
But the portions of these planes outside the unit spheres of the system are 
lateral faces of pentahedra with peaks at « , hence the bases lying in these 
planes are all within these unit spheres. Each of the unit spheres, which 
has its center always at a complex integer, intersects four planes of the system. 

Now if 1/w lies within one of the four unit spheres passing through the origin 
the entire semicircle lies within that sphere and can intersect only the four 
planes intersected by that sphere; and, since it can intersect each plane but 
once, the number of plane bases traversed is not more than four. If on the 
other hand 1/w lies outside one of the four unit spheres through the origin 
the semicircle enters the fundamental pentahedron by its base before inter- 
secting any of the planes. The semicircle then crosses a number of plane 
lateral faces of pentahedra and eventually passes out of the pentahedra with 
peaks at « by the base of one of them. It can then intersect only the planes 
lying in the sphere forming that base, which are four in number. We have 
then the rough result that not more than four plane bases can be intersected 
by the semicircle; hence, the number of terms for which |P| = |p| is not 
greater than four.* 

The following two examples illustrate the three cases just discussed. 

Suite for w = 35: 


10 -1 -125 
i 450 + 307 
Suite for w = 451 
11-1415 -15 (1-301 450+ 30i 


Value of the fractions as approximations to w. It is easy to show that the 
distance of the approximating fraction from the limit w tends steadily to 
zero; that is, that each fraction of the suite is nearer w than the preceding 
fraction. For we have noted the fact that P/Q and p/gq are inverse points 
with respect to the circle in which the sphere M cuts the z-plane. The point 
P/Q lies inside the circle, and is then nearer to any interior point of the circle 
than is p/q. Since w is within the circle, P/Q is nearer w than is p/q. 

This fact follows also from the following important theorem due to Hermite: 
it is easy to show that this number is not greater than three, but we omit the proof of this. 


We can show that one of these fractions can be as far from the beginning of the suite as 
desired, but that any others must remain near the beginning. 
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TuHEeorEM. If p/q is a fraction in the suite for w and r/s is any other fraction 
for which |s| = |q|, then 

q 


(42) 


This theorem is susceptible of a geometric proof, but it lacks the simplicity 
of the following proof given in Hermite’s memoir: Since z = p, y = q is the 
minimum of the form (1) of the Introduction for some value k; of k, we have 


(p — (p — + ki gg < (r — ws) (7 — ws) + kiss. 


This inequality holds for all integral values of r and s different from p and q. 
If now |s| = |q|, ki qq = ki ss, and therefore 


(p — wq)(p — aq) < (r — ws) (7 — as) 


which yields at once the inequality (42). 

There is thus no better rational approximation to w than a given fraction 
of Hermite without having a larger denominator. Since |q| < |Q| this 
shows that P/Q is a better approximation than p/q. 

The fraction preceding p/q. Let us consider all numbers whose suite of 
Hermite contains the fraction p/q, and let us fix our attention upon the 
fraction immediately preceding p/q in each suite. How many different 
fractions appear? The discussion of this problem sheds an interesting light 
upon the geometry of the pentahedral division of the half-space. Since the 
line z = w enters a pentahedron with peak at p/q from a preceding penta- 
hedron by the common base, the number of different fractions is equal to the 
number of the bases of pentahedra with peaks at p/q which are turned upward. 
It is also easily shown to be the number of bases whose points of tangency with 
the sphere S(p/q) lie above the horizontal plane, ¢ = 1/2qq, through the 
center of that sphere. 

Let po, Yo be integers satisfying the equation pgo — poqg = —1. Then 
the transformation of the Group of Picard 


q2 — Yo 
transforms the pentahedra with peaks at © into those with peaks at p/q. 
The pentahedra with bases in common with the preceding have their peaks 
at the complex integers. Hence, on transforming, the pentahedra with bases 
in common with those with peaks at p/q have their peaks at (ps — po)/ 
(qs — qo), where s is given all complex integral values. One of these fractions 
can precede p/q if and only if it has a smaller denominator; that is, if 


(43) z 


las —qo| <|q|, or  |s—4qo/q| <1. 
This leads to the following 


a 
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THeoreM. Let po, qo be integers such that pqog — pog = —1. Then the 
number of different fractions that can immediately precede p/q in a suite of 
Hermite is equal to the number of complex integers enclosed by a unit circle in the 
z-plane whose center 1s qo/q. 

If 81, 8, +++ are the values of the complex integers within this circle, then the 
fractions which can immediately precede p/q are 


Psi — Po P82 — Po 
q81 — qo’ q82 — qo’ 


In Figure 3 a unit square with vertices at complex integers is divided into 
regions by unit circles with centers at the vertices. A number has been 
affixed to each region such that if qo/q lies within 
> the region the number indicates the number of 
complex integers lying in the unit circle with go/q 
as center. If qo/q lies at a vertex of the square 
there is but one integer within the circle (in this 
case, since go/q is an integer, |q| = 1, and p/q is 
an integer), otherwise there are two, three, or four. 
3 3 If p and q are real, po and go can be chosen to be 
real, and qo/q lies upon the side of the square of 
the figure. We observe that in this case the num- 
ber of integers enclosed is in general two, there 
being thus two pentahedra with bases turned upward. Now the plane 7 = 0 
cuts the pentahedra with peaks on the real axis in a configuration which is 
precisely the classic modular division of the half-plane. We can then make 
the following statement: 

In the modular division of the half-plane two and only two triangles with their 
peaks at a point p/q have their bases turned upward, unless p/q is an integer, 
when there is but one. 

Pentahedra with plane bases. Of many other geometrical considerations 
amenable to the methods of the preceding paragraphs* we shall treat one— 
to determine the number of bases of the pentahedra with peaks at p/q which 
are plane. The condition that the base common to pentahedra with peaks 
at p/q and p’/q’ be plane is that S (p/q) and S (p’/q’) be equal in magnitude, 
for these spheres are inverses with respect to the common base. Since q’ is 
of the form gs — qo, this requires that |gs — qo| = |q|, or |s — qo/q| =1. 
That is, s is a complex integer on the circumference of the unit circle whose 
center is go/q. 


Fig. 3 


*We can show, for example, that the number of bases whose points of tangency with 
S (p/q) lie above a plane ¢ = h is equal to the number of complex integers lying within a 
circle whose center is go / q and whose radius is a simple function of h. Also, the number of 
vertices above that plane is equal to the number of mid-points of squares such as Fig. 3 within 
a suitable circle with qo / q as center. 
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Referring to Figure 3 we see that if qo/q is an integer (in which case p/q is 
an integer) there are four integers on the circumference. There are no points 
of the figure on three circles. There are points lying on two circles, but we 
find on solving that these points are irrational, and hence are impossible 
values for go/q. There are, however, infinitely many rational points on one 
circle. Whence, of the bases of pentahedra with peaks at p/q, four are plane 
af p/q is a complex integer, otherwise not more than one can be plane. 

8. Condition that p/q be a fraction in the suite for w. We propose in this 
section to find necessary and sufficient conditions that a given fraction should 
belong to the hermitian suite of a given number without the necessity of 
calculating the suite. That is, we shall find the condition that the line z = w 
should intersect pentahedra with peaks at p/q. If the line does not intersect 
the sphere = (p/q) within which the pentahedra lie, p/q is not a fraction of 
the suite. If the line intersects S (p/q), which lies entirely within the penta- 
hedra in question, p/q certainly is a fraction of the suite. These facts give 
the following statement: 

A necessary condition that p/q belong to the suite for w is that 


a sufficient condition is that 


The case in which z = w intersects = (p/q) but does not intersect S (p/q) 
remains in doubt and requires a test of much greater nicety. If p/q is a frac- 
tion of the suite the line z = w must pass through one of the bases of the 
pentahedra which are turned upward. If these bases are projected orthog- 
onally upon the z-plane w must lie in the resulting figure. Now the boundaries 
of these bases are circles orthogonal to the z-plane, and they consequently 
project into straight lines. The upturned bases then project into a rectilinear 
polygon. This polygon can be constructed when we know the projections, 
that is, the z-codrdinates, of the vertices of the bases. 

We see from the preceding section that these upturned bases are the trans- 
forms of the bases of pentahedra with peaks at © which lie above the complex 
integers 8;, 82, --+ which lie within the unit circle with center at qo/q, where 
the transformation to be applied is 2’ = (pz — po)/(qz — qo). Now the 
z-codrdinates of the vertices of the base above the integer s are the z-codrdinates 
of the vertices of a unit square whose center is s and whose sides are parallel 
to the real and imaginary axes, and the ¢-codrdinate is }¥2. Using the 
second of equations (5) and putting z = z;, ¢ = } v2, we get on simplifying 
2 (2: — qo/q) 


(44) — 90/4) — 90/4) + 11 
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We can now state the following 

THEOREM. Let po, qo be integers such that pqo — pog = — 1, and let 3, 
82, ++ be the complex integers lying within the unit circle whose center is qo/q. 
Let unit squares with sides parallel to the real and imaginary axes and with centers 
at 81, 82, «++ be constructed. As we pass around the perimeter of the connected 
figure thus formed, let z,, z2., «++, 2n be the z-codrdinates of the successive vertices 
encountered. 

From the values z;, «++, %n determine z;, ---, 2, by means of equation (44), 
and form a rectilinear polygon in the z-plane by joining 2, 2,, 2%, ***, 2,2;+ 

Then the necessary and sufficient condition that p/q be a fraction in the suite 
for w is that w lie within or upon the boundary of this polygon. 

We see from the inequalities at the beginning of this section that this 
polygon will lie within the circle |z — p/q| = 1/ V2qq, and that it will enclose 
the circle |z — p/q| = 1/2qq.* 

The condition that po/qo and p/q, where pqo — pog = — 1, are successive 
fractions in the suite for w, is that the line z = w should intersect the base 
common to the two pentahedra with peaks at po/qo and p/q. This base 
projects into a quadrilateral in the z-plane. Furthermore this base is derived 
from the base of the fundamental pentahedron by means of the transformation 
z’ = (pz — po)/(qz — qo). The z-codrdinates of the vertices of the funda- 
mental pentahedron are (+1-+7)/2. From these four values we get 
Z,, 2,» 23, 2, from (44). Then the necessary and sufficient condition that po/qo 
and p/q be successive fractions in the suite for w is that w lie within or on the 
boundary of the quadrilateral zz, 


‘ PART II 


REGULAR CONTINUED FRACTIONS 


9. Preliminary account. The continued fraction 


€1 

Oo + 
b+ 

by 
bs 
or as commonly written 
€ € € 


|b) |bo + [bs 
* If z, is the vertex of this polygon which is farthest from p/q, we have, when p/q is a 
fraction in the suite for w, 


This is generally a stronger inequality than (32). I find that | z, — p/q| can be as small as 
but never smaller than 2 / 3qq. 
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will be called regular if «, = + 1 or +7 and Jb, is a complex integer.* It is 
clear that by multiplying each ¢, and the corresponding denominator by a 
suitable one of the numbers + 1 or + 7 the continued fraction can be put in 
the following form: 


(45) — 


— | — 


where s, (= + b, or + ib,) is a complex integer, and it is this form that we 
shall constantly employ.t 


1 
a — — 
called the nth convergent, and will be represented by p,/q,. If the fraction 
terminates with s, its value is Pn41/qn41. If it does not terminate it is said 
to converge if p,/qgn approaches a limit as n becomes infinite; and the value 
or sum of the fraction is defined to be that limit. 

The first convergent p;/q: is 89, the second po/q2 is (80 81 — 1)/si1. Putting 
Pi = 8, Gi = 1, po = 8981 — 1, Go = 81, we note that poqi — pige = — 1. 
Proceeding from these two convergents, all succeeding convergents can be 
calculated by means of the recurrence formule 


(46) 


The value of the fraction when Is - is set equal to zero is 


= Pn Sn — Pn-15 Qn+1 Qn — Qn-1> 


and the relation 
(47) Pati dn — Pn = — 1 

always holds. We note that the convergents are all rational fractions in their 
lowest terms. 

If we put po = 1, go = 0, we observe that the formule (46) for n = 1 give 
p2 and gz. We shall then adjoin to the convergents defined above the con- 
vergent po/qo = 1/0. 

The continued fraction of the suite of Hermite. We can set up at once a regu- 
lar continued fraction whose convergents are the fractions of a suite of Hermite. 
The first two fractions of the suite we found to be 1/0 and a/1. In deter- 


* The continued fractions involving complex integers have been little studied. Only one 
kind of such fraction has, so far as I know, appeared in the literature. See Hurwitz, Acta 
Mathematica, vol. 11 (1887), pp. 187-200; Auric, Journal de mathéma- 
tiques, 5thser., vol. 8 (1902), pp. 387-431. 

t The fraction with positive signs 


bo + 


+ + [bs + 
can be put in the form (45) as follows: 


1| 1| 1| 


~ = Th 


bo 


that is, by merely changing the signs of b:, bs, bs, «+: 
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mining a fraction P/Q in terms of the two preceding, p’/q’ and p/q, an integer 
s was found and the formule (30) used: 


P=ps—p’, Q=q-q7. 


These formule are identical with (46). Hence, if we put s) = a and let 
$1, 82, 83, *** be the integers found at each step of the calculation of the 
fractions, the fractions of the suite are identical with the convergents of the 
fraction (45). 

10. A geometrical interpretation. By means of the S-spheres (Section 4) 
we can give the continued fraction a geometrical interpretation. Equation 
(47) shows that S(pn41/qn4i1) and S(pn/qn) are tangent. The convergents 
of a regular continued fraction, 1/0, 80/1, p2/q2, p3/qs, -**, determine a suite 
of S-spheres, S(1/0), S (80/1), S(p2/q2), «++, of which the first is the plane 
¢ = 1, and such that each sphere of the suite is tangent to that which precedes it. 

Conversely, if S(1/0), S( 80/1), S(p2/q2), «++ any suite of S-spheres 
(the first sphere being § = 1) such that each is tangent to that which precedes it, 
then the points of tangency of the spheres, 1/0, 80/1, po/qe, «++, are the con- 
vergents of a regular continued fraction. 

To prove the converse we note that the tangency of the S-spheres (see the 
theorem at the end of Section 4) requires that pr qn-1 — Pn-1 dn = £1, OF 


+i,n=1,2,3,--+-. Beginning with the second term of the suite we can 
multiply numerator and denominator by a suitable one of the numbers + 1 
or +7280 that Qn-1 — Pn—-1 In = — 1 always. 


Now the transformation z’ = (paz — Pn—1)/(Gn2 — Qn-1) transforms ¢ = 1 
into S(p,/qn); and the spheres tangent to ¢ = 1, namely S(s,,/1), where m 
takes on all complex integral values, are carried into the spheres tangent to 
S(pn/qn). One of these, say S(s8,/1), is carried into S (pn+1/qn41) , Whence 


Qn+1 Qn 8n — Yn-1 
and we can take 


= Pn Sn — = Qn 8n — 


which are precisely the formule (46). We can determine s, in this manner 
for each value of n; and these values put into (45) give a regular continued 
fraction whose convergents are the z-coérdinates of the points of tangency of 
the suite of spheres. * 

We note that if s, = 0 the recurrence formule give pnyi/qn41 = Pn—1/Qn-13 
that is, S(p,/qn) is preceded and followed by the same sphere. Also there is 
an infinite convergent as often as S(1/0) appears in the suite of S-spheres. 


* We can in a like manner give an interpretation by means of =-spheres, the suite beginning 
with =(1, 0), or ¢ = 32, and each sphere intersecting that which presedes it orthog- 
onally. 


j 
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11. Second geometrical interpretation. Generating curves. We shall now 
introduce an interpretation which will be more useful for most purposes, an 
interpretation which is an immediate generalization of that given for the 
fractions of Hermite in Section 3. We shall treat only the case of convergent 
continued fractions. 

THEOREM. Let a moving point trace a continuous curve from a point above 
the plane £ = 1 to the point z = w in the z-plane. Let this curve lie entirely in 
the space above the z-plane except at the point of termination, and let it intersect 
no base more than a finite number of times. Then the z-codrdinates of the succes- 
sive peaks of the pentahedra through which the point passes are the convergents in 
order of a regular continued fraction whose sum is w. 

Conversely any regular continued fraction converging to the value w can be 
generated in this way. 

Since the beginning of the curve is in a pentahedron with peak at ~, the 
first convergent is 1/0 as required. Also since the moving point passes from 
one pentahedron into another with a different peak by passing through the 
common base, the condition that successive S-spheres be tangent is satisfied. 
(If the point passes through a vertex it becomes necessary to choose the next 
pentahedron properly in order that this be true.) 

As a first step toward establishing the convergence let us show that if P is 
any point on the curve, other than the point of termination, and p/gq is the 
z-codrdinate of the peak of the pentahedron in which P lies, there are only a 
finite number of convergents preceding p/qg. The portion of the curve pre- 
ceding P lies above some plane ¢ = ¢ > 0; and it is easily seen that in any 
finite region above such a plane there are portions of only a finite number of 
pentahedra. The proof of this is as follows. The sphere = (p,/q,) in which 
pentahedra with peaks at p,/qn lie has the radius 1/ V2q, Gn, and in any finite 
part of the plane there are only a finite number of fractions for which q, has a 
sufficiently small value that this sphere be large enough to intersect ¢ = e. 
Further, if the sphere is sufficiently large only a finite number of the penta- 
hedra with peaks at p,/q, are intersected by the plane. For a transformation 
of the form 2’ = (q’2 — p’)/(dn2 — pn) carries the pentahedra with peaks at 
Pn/Qn into those with peaks at ©, and carries ¢ = e into a sphere of radius 
1/2€qn Gn tangent to the z-plane at q’/q,. This latter sphere obviously inter- 
sects only a finite number of pentahedra with peaks at ~. Hence { = «€ 
intersects only a finite number of pentahedra with peaks at p,/qn. 

We conclude from the above that there are only a finite number of bases 
intersected by the curve before reaching the point P, and by hypothesis each 
is intersected only a finite number of times. As each passage through a base 
yields a new convergent, it follows that only a finite number precede the 
convergent p/g mentioned above. 


5 
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Secondly, we must show that as P approaches the termination of the curve 
the corresponding convergent approaches w. It has just been pointed out 
that the number of fractions in a finite region—in a given neighborhood of w 
for example—whose 2-spheres have a radius exceeding any small positive 
quantity, 7 say, is finite. Unless w is one of these rationals we can construct 
a sphere R with w as center of sufficiently small radius that it intersects none 
of these =-spheres. Then, if p/q is the peak of the pentahedron in which P 
lies, = (p/q) has a radius less than 7, when P lies within R. As P approaches 
w we can decrease the radius of R and likewise let 7 approach zero. Since 
= (p/q) always contains P and since the radius of 2 (p/q) approaches zero, 
the point of tangency, p/q, approaches w. 

If w = p’/q’, a rational, the above reasoning applies to all =-spheres except 
= (p’/q'), but as the point of tangency of this sphere is equal to w, we have 
convergence in this case also. 

To show that any convergent regular continued fraction can be defined in 
this way, we consider the suite of S-spheres, S(1/0), S(s0/1), S(p2/q2), 
--+, determined by the fraction. Let A be a point above the plane ¢ = 1, 
and join A to the point of tangency of S(1/0.) and S(8/1) by a straight 
line, then join this point to the point of tangency of S(s8o/1) and S ( pe/q2) 
by a straight line. Continuing this process we get a continuous curve com- 
posed of segments of straight lines. It is obvious that the convergents deter- 
mined by this curve are precisely those of the continued fraction. 

It remains to show that a point moving along the curve thus determined 
approaches w as its terminating value. If the fraction terminates we shall 
join the last point of tangency to the point w, the line thus lying entirely 
within the last S-sphere, and the condition demanded is obviously satisfied. 

If the fraction does not terminate its convergence requires that the de- 
nominator of the nth convergent p,/q, should approach infinity as n increases 
(excepting the case where w = p’/q’ and p’/q’ appears infinitely often as a 
convergent). Then the radius of the sphere S (p,/qn) approaches zero and a 
point on the straight line segment within the sphere approaches w. (If p’/q’ 
appears infinitely often this reasoning applies to all spheres except S (p’/q’). 
But p’/q’ can not appear twice in succession as a convergent and the segment 
within S(p’/q’) joins points of tangency of decreasing spheres and this part 
of the curve also approaches p’/q’ or w.) The theorem is thus established. 

12. Connection of the preceding with quadratic forms. Let the curve of 
the preceding section be represented in parametric form 


(48) F=f(t), 


the interval of the parameter being = t,. We have then f(t,) > 1, 
(te) + ide (te) = w, f(t) = 0. By referring to Section 2, we see that the 


i 
4 
i 
a 
4 


1918] IRRATIONAL COMPLEX NUMBERS 27 


point on the curve whose parameter is ¢ is the representative point of the 
definite hermitian form 


(49) {x — [gi (t) + tho (t) {% — [oi (t) — (t)] 9} + (f(t) P yg. 


The proposition of the preceding section can then be put in the following 
form: 

THEOREM. Let (po, 90), 91), be the successive complex integral 
values of (x,y) yielding the minimum of the form (49) as t decreases from t, 
tot.. Then po/qo, pi/q, +++ are the successive convergents of a continued fraction 
whose sum is w. 

The suite of Hermite is an example of this, in which ¢; (tf) + i¢2(t) = @, 
a constant, and f(t) =t. 

13. The convergent as an approximation to the sum. From a knowledge of 
the form of the generating curve we can usually determine the degree of 
approximation to the sum of the continued fraction which the convergents 
afford. For example 

If when t < t’ the generating curve lies always within the right circular cone 
whose vertex is w and whose generators make an angle a with the z-plane, then 
there exists an integer m such that 


k 
(50) | whenn>m, 


where k = 4-2 cot ia. 

If when t < t’ the generating curve lies always within a sphere of radius a 
tangent to the z-plane at the point w, then there exists an integer m such that 
(51) | 
wherek = Wa. 

In the first, since the curve passes through 2(p,/q,) that sphere must 
intersect the cone in question. The inequality is a direct result of this require- 
ment. It evidently holds after the convergent p~m/qm corresponding to the 
point whose parameter is t’. The inequality (51) expresses the fact that 
> (pn/qn) intersects the sphere of the theorem. 

The first theorem is applicable to any generating curve possessing at the 
point w a tangent not lying in the z-plane, the second to those curves having 
simple contact with the z-plane at that point. The inequality satisfied by 
the fractions of Hermite is deduced at once from (50) in putting a = 7/2. 
If w = p/q, a rational fraction, the fraction always terminates in the first 
case, and terminates in the second case if a = 1/2qq, for then the generating 
curve after the point ¢’ lies always within pentahedra with peaks at p/q. 

THEOREM. If the generating curve possesses contact of order \ with the z-plane 


< Tal whenn>m, 


qn|’ 
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at the point w, there exists a constant k such that all convergents satisfy the in- 
equality 
Pn k 

Representing for brevity the point on the curve by z(t), ¢(t), let a(¢) be 
defined by the equation 

f(t) =a(t)|z(t) — |". 
Then Lim a(t) = 0, or a + 0, or ~, according as y is less than, or equal to, 

or greater than} +1. From this it follows that if a; < a, after a certain 
point the generating curve lies entirely within the surface of revolution 


After a certain convergent > ( p,/q,) must intersect this surface. The surface 
is tangent to the z-plane at w, and as convergents are taken closer and closer 
to w the surface above them approaches more nearly a plane in form. If we 
replace a; by a still smaller value a2, the ratio of the ¢-coérdinates of the 
two surfaces being [2/f; = a2/a, < 1, we see that after a certain point in 
the series of convergents the highest point of the sphere 2(pn/qn), viz., 
z = Pn/qn, € = V2/qn Gn, lies always within the latter surface. From this 


v2 Pn 
Gn In Qn 

This is an inequality of the form (52), determining a value of k. We can 
increase k sufficiently to make the inequality hold for the finite number of 
convergents at the beginning of the series for which the inequality just found 
does not hold, and the theorem is established. 

It is easy to show that if the curve possesses contact of order \ the con- 
vergents do not all satisfy an inequality of the form 


2 


where u> M41 


however great k may be.* 

14. Formule for the calculation of the continued fraction. The determina- 
tion of the convergents defined by a given generating curve will be a very 
complicated procedure. This we found to be the case (Section 5) when the 
a By the use of a generating curve whose contact exceeds any finite order, such as 

z(t)=w+t, F(t) 


we can establish the existence of convergent continued fractions whose convergents satisfy no 
inequality of this form, however large k and however small » ( > 0) be chosen. 


f 
; 
Dn k 
Qn 1Qn 
ad 
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generating curve is a straight line. We propose to give formule for a con- 
tinued fraction analogous to the “ modified hermitian suite’ of Section 6. 
For this purpose we take as the convergent following p/q the z-coérdinate of 
the peak of the pentahedron in which lies the last intersection of the curve with 
~(p/q). Since the bases of the pentahedra with peaks at p/q iie very close 
to &(p/q) this process will give in most cases in which simple generating 
curves are employed the convergent immediately following p/g in the original 
fraction. In any case we shall thus set up a continued fraction differing from 
the original by the omission of certain convergents, and only a finite number 
of successive convergents will be omitted.* We note that in this modified 
continued fraction each convergent appears but once. 

A consideration of the curve of rectilinear segments treated in Section 11 
shows that if each convergent appears only once the last intersection of the 
curve with 2 (pn/qn) lies within S (pn41/dn41) and thus gives the correct 
convergent following p,n/qn. That is, any regular continued fraction, no two 
of whose convergents are equal, 1s the modified continued fraction of some generating 
curve. 

Let the generating curve have the equations (48). The first convergent 
will be 1/0; the second will be the complex integer nearest to the z-coérdinate 
of the last intersection of the curve with the plane 2(1/0), or ¢ = 142. 
As the parameter ¢ is supposed decreasing as we proceed along the curve, the 
parameter of this point of intersection is the algebraically smallest root, t’, 
of f(t) = 142. The second convergent is 8, the nearest integer to ¢; (t’) 
+ ide (t’). Wetake then pp = 1,q0 = 0, pi = 80, 1 = 1, whence pi Go — Pou 
=-l. 

Now let Pn-1/Gn-1, Pn/Gn be successive convergents, where 


We propose to find pnsi1/dn41. The sphere 2 (p,/qn) has the equation, from 


(17), 


Replacing z and ¢ by their parametric values, we find that the parameter ¢, 
of the last intersection of the curve with this sphere is the smallest root of the 
equation 
(54 [$1 (t)? + + f(t)? qn dn — (t) + Dn Qn 

[ (t) = ide (t)] Pn Qn (t) + Pn Pn = (). 


= Except in the case of a fraction having p/q ( = w) appearing infinitely often, which 
may be rendered terminating. 
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On making the transformation 2’ = (qn-12 — Pn-1)/(Qn2 — Pn), the sphere 
= (px»/qn) becomes the plane ¢ = 3 V2, and the point whose parameter is tp 
becomes a point in that plane whose z-coérdinate is (employing the second 
of equations (5)) 


[ (tn)? + (tn)? +H (tn)? In — [G1 (tn) + tbe (tn) Gin 
— [G1 (tn) — tho (tn) Pn 1 Qn + Pn-1 Pn 
[ bi (tn)? + (tn)? +f (tn)? Qn Qn — [G1 (tn) + the (tn) in 
[ bi (tn) tbe (tn) Pn Yn + Da Pn 


By performing a division and using the fact that ¢, is a root of (54) this can be 
written in the form 7 
(55) 
qn V2qnf (tn) 


The peak of the pentahedron in which this point in the plane ¢ = 3 V2 lies 
is at s,, the nearest integer to (55). Transforming back by the inverse of 
the above transformation, z’ = (paz — Pn-1)/(Qn2 — Gn—1), We can write 


Pritt = — Pn-r; Qnti = Yn Bn — whence 
(56) 
Putin — Po = — 1. 

We have then in brief the following method of procedure. 

The first two convergents are po/qo, Pi/qi where po = 1, Go = 0, ~i = 80, 
qi: = 1, in which so is the nearest integer to o,(t') + ide (t’) where t’ is the 
smallest root of f(t) = } V2 in the interval , St St. 

Beginning with these two convergents, the following convergents are calculated 
by means of the recurrence formule (56) where, after t, has been found as the 
smallest root of (54) in the interval, s, is the complex integer nearest to the 
quantity (55). 

The continued fraction with these convergents is then 


15. An example. The preceding process will involve considerable labor 
unless the generating curve is a simple one, but it furnishes a method of setting 
up continued fractions with desired properties. If we choose a straight line 
or circle as generating curve there are only two intersections of the curve with 
> (pn/qn) and the determination of t, involves only the solution of a quadratic 
equation. 

We shall now examine a case in which the generating curve is a circle tangent 
to the z-plane, 


(57) z=wrt+ 


V2t 


i 
| 
ig 
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This is of radius } V2 and lies in a plane parallel to 7 = 0. The interval 
for tis such that half of the circle, from its highest point to the point of tangency 
, is considered. 
” We find that 8 is the complex integer nearest to w +42. The equation 
(54) for the determination of ft, is 


(0 + Gn — + Gn + En dn) + = 0, 
in which 0, = dn — The roots of this are 
— V2 Gn + V2 Gn + qn)? — 40n Gn (Un in + Gn — 2) 
2 tn + 2Gn Gn — 2) 


The denominator of this expression is easily seen to be positive, hence the 
smallest root is that with the negative sign of the radical. 

Putting this value of ¢, into (55) and simplifying we find that s, is the nearest 
complex integer to the quantity 


Qn-1 — Un Un 


— — Vu, ti, — — 


where Un = 1/(qnw — pn), and iB, is the.imaginary part of qn Un. 
If w is real p,, qn, and w, will be real and (58) takes the particularly simple 
form 
— Un 


(58) 


+ — 3. 


Following are two examples of this development: 


1+i 1| 1| 1| 1| 1| 1| 


The radius of the generating circle, } v2, is sufficiently great so that if w=p/q, 
a rational, the generating circle is not contained within >(p/q); hence no 
pentahedra with peaks at p/qg are intersected. This development, therefore, 
furnishes always non-terminating continued fractions. 
An application of the inequality (61) shows that the convergents always 
satisfy the inequality 
| Pa| 


This can be derived also from (58), for, since the expression under the radical 
must be positive, wu, i, = 4, which is the inequality just found. 
The periodicity of these continued fractions will be discussed in a later 
section. 
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16. Periodic regular continued fractions. The continued fraction (45) is 
called periodic if integers m and k exist such that 8,4; = 8, whenever n > m. 
We shall write the periodic fraction in the form 


where the portion between the stars is repeated again and again. 
Let us suppose that the fraction converges and that w is its sum. Let wo 
represent the convergent fraction 


(60) 
Then 


(61) @= 


1| 1 1 m- — Pm . 
— — Wo — Ym 


where Pm/Gm, Pm+1/Qm+1 are convergents as previously defined of (59). Also 


| = | — @o 


(62) Wo = 8m4+1 — aad V (wo) ’ 
where are convergents of (60). 
Then, employing the usual notation, 


(63) w = U(w) = UV = UVU" (wa). 


That is, the transformation z’ = UV U~'(z), which is of the Group of Picard 
since U and V are, carries w into itself; w is a fixed point of the transforma- 
tion. If we write this transformation z’ = (az + 6)/(yz +6), equation 
(63) exhibits the well-known result that w is the root of a quadratic equation 


(64) y+ (6-—a)z—B=0, 


whose coefficients are complex integers. 

We shall now establish the following important 

THEOREM. The transformation = UVU-(z) transforms p,/qn into 
Pntk/Qn-k, Whenn > m. 

Conversely if integers m and k exist such that 


Qntk Qn 


where z' = T(z) is a transformation of the Group of Picard, the continued frac- 
tion is periodic. 
Now p,/qn is the value of the fraction when it terminates with s,1. It is 


| 

1| 1| 1| 
(59) 80 — 7 
{8m — |8m+1 — — 

* * 
1| 1| 

* | 8m+2 — | 
| 
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equal to the second member of (61) if wo be replaced by 


That is, pn/gn = U(A). Similarly 
1 1 1 
Qn+k — — Sntk—1 


Hence, Pnit/Qnex = UVU"(pr/qn). This reasoning does not require the 
convergence of the continued fraction. 
To establish the second part of the theorem, let 


T(z) = (az + B)/(yz+ 6), 


where the integers a, 8, y, 6 satisfy the equation ad — By = 1. Then, if 
n > m, we have 

Pntk _ &Pn + BQn 

Qnt-k  YPn + 


It is easily shown that both fractions are in their lowest terms. Hence 
Pntk = €n(QPn + BQn), = €n(¥Pn t+ +72. 
From this and the similar equations for pasx—-1, Yn4z-1, WE have 
Pntk — Pntk—-1 = €n €n—1( Pn Qn-1 — Pn-1 Qn) 


Since the first member, and also the bracketed part of the second member are 
each — 1, we have = 1. Likewise €.41€, = 1, ete., whence €m41, 
€m42, *** are either all real or all imaginary. 

Finally, 


= €n¢1(@Pnzi + = — + BC Gn — ] 


= €n+1 [ (apn + BQn) + = €n+1 | 


En+1 €n = Park “3 Patk—-1e 
€, En E€n-1 En 


€n-1 


Similarly 
Sn 
Qn+k+1 = Qntk — 
n 


Hence = 8n/e.. If en = + 1 always, then s,,, = if = + i always, 
then 8,4: = — $n. In the latter case we have however Sn42% = — Snik = Sn- 
We thus have periodicity in all cases. 
17. The character of the transformation 7 and the resulting properties of 
the continued fraction. A more intimate knowledge of the periodic fraction 
Trans. Am, Math. Soc, 3 
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necessitates a few remarks on the different types of linear transformations. 
Consider a transformation 2’ = (az + 8)/(vyz +6) where ad — By = 1. 
If a +6 is not real the transformation is called loxodromic. The trans- 
formations of which a + 6 is real are of three kinds: elliptic if |a + 6| < 2, 
hyperbolic if |a + 5| > 2, and parabolic if |a + 6| = 2. 

The fixed points of the transformation are the roots of equation (64). There 
is but one fixed point in the case of the parabolic transformation, in all others 
there are two. We shall treat first those transformations with two fixed 
points, 7; and rz. Let C be the semicircle orthogonal to the z-plane through 
the two fixed points and lying in the upper half-space. 

If the transformation is elliptic it is equivalent to successive inversions in 
two spheres passing through C.* If we call the diedral angle between these 
spheres 6/2 it is necessary that @ be a submultiple of 27 in order that the 
group be discontinuous. The transform of any sphere through C is a sphere 
through C making an angle @ with the original. After a finite number of 
applications of the transformation all points of space are brought back to 
their original positions. 

If 7 is elliptic there exists an integer v such that TY = 1. Then 


Pn/ dn 


when n>m. The convergents Dm+ky/Qm+ky appear in- 
finitely many times, and the continued fraction does not converge. We shall 
lay this case aside. 

If 7 is hyperbolic it is equivalent to successive inversions in two spheres 
with centers in the z-plane and cutting C orthogonally. Each circle C’ through 
7, and 72 is transformed into itself. By repetitions of 7 the transforms of all 
points of space (one fixed point excepted) approach one of the fixed points 
(= w, the sum of the fraction), all the transforms of a point lying on the 
circle C’ through that point. 

Let circles C,, C,, --+, C,, be drawn through the fixed points in such a 
manner that intersects Now 2 (pn/qn),n > m, is merely 
the transform by applications of T of one of these k =-spheres, hence the k 
circles intersect all 2-spheres after 2 (pPm/qm). These circles make angles 
greater than zero with the z-plane, and we see readily that we can construct a 
right circular cone with vertex at z = w and with axis perpendicular to the 
z-plane which intersects all the =-spheres of the convergents. From this we 
conclude by the reasoning of the first theorem of Section 13 that there exists a 
constant k such that all the convergents satisfy the inequality 


Gn 


* For the facts mentioned in this section see Poincaré, loc. cit. 


(65) 


4 
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If 7 is loxodromic it is equivalent to the successive performance of two 
transformations with the fixed points 7; and 72, one elliptic, the other hyper- 
bolic, but the angle 6 of the former need not be a submultiple of 27. C is the 
only fixed circle of the transformation, but we can construct a fixed surface. 
Let H be the circle of intersection of two spheres orthogonal to C. Then 
H is a fixed circle for the elliptic component of 7. If we form a surface T by 
constructing circles through 7; and 72 and the points of H this surface is 
invariant under the transformation 7’, for the elliptic component of 7’ trans- 
forms one of these circles through a point of H into another such circle, and 
the hyperbolic component leaves the latter circle invariant. 

Surfaces T,, ---, I’, can be constructed to intersect ( *** > 
(Pm+k/Gm+k); and (pn/grn), n > m, will always intersect one of these 
surfaces. Now H,, ---, Hy by means of which these surfaces are constructed 
will lie entirely in the upper half-space, and the I’-surfaces will in the neighbor- 
hood of w all lie within a suitable cone with vertex at w and with axis per- 
pendicular to the z-plane. This cone will intersect all =-spheres after a certain 
point in the suite of convergents, and as before it can be enlarged to intersect 
all 2-spheres of the suite. Therefore, a constant k exists such that all the con- 
vergents satisfy the inequality (65). 

The parabolic transformation presents a very different aspect. In this 
case there is a single fixed point 7. Any circle through 7 and tangent to a 
certain line in the z-plane is transformed into itself. The transformation is 
equivalent to successive inversions in two spheres through 7 orthogonal to 
the line just mentioned (and hence tangent to one another). 

Let Ci, ---, C; be members of the family of fixed circles just mentioned 
such that intersects = Then (pn/qn), n > m, intersects 
one of these circles. A sphere tangent to the z-plane at z = 7 = w and en- 
closing all these circles can be constructed, and its radius can be enlarged 
sufficiently to intersect also the 2-spheres preceding 2 ( pm41/qm4i). From 
the reasoning of the second theorem of Section 13 a constant k exists. such that 
the convergents all satisfy the inequality 


(66) 


It is easily shown that they do not all satisfy an inequality of the form (65) 
however large k be chosen. 

The preceding treatment covers all possible cases. If we add the remark 
that the fixed points of the hyperbolic and loxodromic transformations are 
always irrational, whereas the fixed point of the parabolic transformation is 
always rational, we can state the following 

THEOREM. The convergents of a convergent periodic regular continued fraction 
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always satisfy an inequality of the form (66). The sum of the fraction is irrational 
or rational according as an inequality of the form (65) is or is not satisfied. 

18. On quadratic irrationals. It is known from the theory of quadratic 
forms that there are infinitely many transformations of the Group of Picard 
leaving invariant the roots of an equation 


(67) A? + Bz +C =0, 


in which A, B, C are complex integers and the discriminant, D = B* — 4AC, 
is not a perfect square.* The roots are irrational. If 7 is one such trans- 
formation all powers of 7 also leave the roots invariant. Since there are 
only a finite number of elliptic transformations possible, it follows that 7 can 
be chosen to be either hyperbolic or loxodromic. 

For a better understanding of what follows we shall prove the following 

THEOREM. The necessary and sufficient condition that T can be chosen to be 
hyperbolic is that DD be a perfect square. 

The roots of the given equation being the fixed points of 7’, 


z’ = (az + B)/(yz +8), 
they are the roots of the equation 


If A, B, C are without a common factor, as we can suppose without loss of 
generality, we have 


y = uA, 6—a=uB, B= -1C, 
where wu is a complex integer. If we put a + 6 = t, whence 
6 = (¢+ uB)/2, a = (t—uB)/2, 
the relation ad — By = 1 yields the well-known equation of Pell 
— Du? = 4. 


Conversely any integral solutions of Pell’s equation which yield integral 
values of a and 6 provide a transformation leaving the roots of the equation 


unchanged. 
Let 7 be hyperbolic. Thentis real, and Du*isreal. Writing D = m + ni, 
u = a-+ bi, and equating to zero the imaginary part of Du*, we have 


(a? — b?)n + 2bm = 0. 


If either m or n is zero, obviously DD is a perfect square. Otherwise we can 


* Fricke-Klein, Theorie der automorphen Functionen, vol. 1, pp. 464-467. 
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write this in the form 
(an + bm)? = + n?) = DD. 


We can suppose b + 0, for b = 0 requires a = 0, and we have t = +2, 
u = 0, yielding the transformation 2’ = z. The equation then shows that 
DD is a perfect square, and the condition is necessary. 

To prove it sufficient, we suppose DD = G?, where G( > 0) is an integer. 
The circle in the z-plane with the line joining the roots of (67) as a diameter 
has its center at the point — B/2A and its radius is | VD/2A|, or VG/4A4. 


Its equation is 
* 


The sphere, R, with this circle as equator is the representative sphere of the 
indefinite hermitian form 


+ By) (24z + BG) — 


From the theory of these forms it is known that among the portions of this 
representative sphere lying in the different pentahedra a finite number can be 
chosen from which all others can be derived by suitable applications of trans- 
formations of the Group of Picard.* 

Let U be a transformation of the group leaving the roots of (67) unaltered. 
The circle C , orthogonal to the z-plane through the roots, is a fixed circle of U. 
Also C lies on the surface of R. Let II be a pentahedron through which C 
passes (we use here the single, not the double pentahedron), and let II,, Iz, 
--+ be the transforms of II by U, U?, ---. Let B, By, Bz, --- be the por- 
tions of the surface of R lying in II, II,, Iz, --+. Since only a finite number 
of these are non-congruent (i. e., not derivable from one another by a trans- 
formation of the group), we can choose two congruent ones B,, and B,. The 
transformation carrying B,, into B, is that carrying II,, into II,, that is U"™. 
If @ is the angle of the elliptic component of U"™” any sphere through C is 
transformed into another making an angle @ with it. Such a sphere is R, 
which is transformed into itself; hence @= 0 or z. If 6=0, U™™ has no 
elliptic component and is therefore hyperbolic; if @ = 7, U?—™ is hyperbolic. 
The transformation 7 can be chosen to be this hyperbolic transformation, 
and the condition is thus sufficient. 

If DD is a perfect square we shall call a root of the equation a hyperbolic 
irrational, otherwise we shall call it a lorodromie irrational. 

19. Periodicity when w is a hyperbolic irrational. We propose now to estab- 
lish certain general theorems concerning periodicity when the continued 


* Fricke-Klein, loc. cit., p. 467. 


37 
¥ 
a 
e 


38 L. R. FORD [January 


fraction is defined by a generating curve as described in Section 11. We shall 
treat only curves having a continuously turning tangent in the neighborhood 
of the point w. For periodicity, if w is a quadratic irrational, this tangent 
cannot lie in the z-plane. Let w’ be the other root of the quadratic equation 
of the form (67) satisfied by w, and let 7 be a hyperbolic transformation with 
w, w’ as fixed points and let all points be carried to the neighborhood of w 
by repeated applications of 7’. 

Let C’ be a circle through w, w’ and tangent to the generating curve at w. 
Then C’ is a fixed circle of 7, and an are A of C can be found such that all 
subsequent parts of the circle in the upper half-space are derived from this 
arc by applications of 7’, T?, ete. Hence the subsequent pentahedra through 
which C’ passes are derived from those through which A passes by these 
transformations, and the same is true of their peaks. 

Now suppose that A does not intersect an edge adjoining a base and is 
not tangent to a base. Then a small circle K can be drawn with a point on 
A as center and lying in a plane perpendicular to the are at this point such 
that the surface formed by circles through w, w’ and the points of K intersects 
always the same bases and in the same order as A. This surface being also 
invariant under 7’ intersects throughout its whole length the same bases and 
in the same order as C’. In the infinitesimal neighborhood of w this surface 
is a cone surrounding the tangent to the generating curve. After a certain 
point the generating curve lies entirely within this surface, and intersects the 
same pentahedra as C’. The 2-codrdinates of the peaks then satisfy the rela- 
tion Pnit/dnek = T(pn/qn) for a suitable value of k, and the fraction is 
periodic. Hence, the following 

TueoreM. [If at the hyperbolic irrational w the circle through w, w' tangent 
to the generating curve does not intersect an edge bounding a base of some penta- 
hedron and is not tangent to a base the continued fraction is periodic. 

If C’ intersects an edge adjoining a base the generating curve may in the 
neighborhood of w intersect the transforms of any of the bases adjoining this 
edge. If however a surface be formed by circles through w, w’ and the edge 
this surface will divide the cone at w into two parts, and if the generating curve 
lies after a certain point always within one of these compartments it will inter- 
sect always the transforms of the same base and the fraction will be periodic. 
This will always occur if the generating curve is algebraic unless it lies entirely 
upon the algebraic surface just formed, for it can intersect that surface only a 
finite number of times. The case in which C’ is tangent to a base can be treated 
in a similar manner. 

The preceding theorem requires alteration for the modified continued 
fraction for which formule were given in Section 14. Here the convergent 
following p/q does not depend upon the intersection of the curve with a base 
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but with the sphere = (p/q). The points of this spherical surface within the 
circular quadrilateral formed by the intersection of the four lateral faces of a 
pentahedron with peak at p/g all lie within pentahedra having the same peak. 
If we replace the condition of the theorem by the condition that C’ shall 
“not intersect the circle in which the lateral face of some pentahedron inter- 
sects the sphere of its vertices” the theorem holds. As before this condition 
can in general be dispensed with if the generating curve is algebraic. 

20. Periodicity when w is a loxodromic irrational. If 7 is loxodromic the 
only fixed circle is the circle C through w, w’ orthogonal to the z-plane. An 
are A of C can be found such that the part of the circle between A and w is 
derived from A by the transformations 7’, 7”, etc. Suppose that A intersects 
no edge bounding a base. [Since C is orthogonal to the z-plane it cannot be 
tangent to a base.] About a point of A let a small circle H, as in Section 17, 
be constructed—that is, the circle of intersection of two spheres orthogonal 
to C. The surface T formed by circles through w, w’ and the points of H is 
invariant under 7’. Further, if H be sufficiently small this surface intersects 
the same bases and in the same order as A, and hence the same bases as C 
throughout its entire course. Now let H be enlarged until T intersects an 
edge or becomes tangent toa base. Call this surface 9. In the infinitesimal 
neighborhood of w, Ip is a right circular cone with the line z = was axis. We 
have the 

THEOREM. If w is a loxodromic irrational and the circle C through w, w’ 
orthogonal to the z-plane does not intersect an edge bounding a base, there exists a 
right circular cone To with vertex at w and axis perpendicular to the z-plane, such 
that if the tangent to the generating curve lies within 19 the fraction is periodic, 
and if it lies without the fraction 1s not periodic. 

To prove the latter part of the theorem let H be increased until the tangent 
to the generating curve is tangent to T at the point w. NowT will intersect 
certain bases not intersected by C. Let B be one of these bases. The trans- 
forms of B by 7, T?, --- approach w and owing to the elliptic component of T 
are rotated about the circle C through multiples of an angle incommensurable 
with 27. Infinitely many of these are intersected by the generating curve. 
But no transformation U exists such that each one intersected is derived from 
the preceding by an application of U. For all would then be derived from the 
first by U, U’, ---, and since the angle of the elliptic component of U must be 
incommensurable with 27 certain of these bases are on the opposite side of C 
from the generating curve and are not intersected by it. The fraction cannot 
then be periodic. 

The theorem relative to the modified continued fraction is stated in the same 
terms as before—by substituting “the circle in which a lateral face of a 
pentahedron intersects the sphere of its vertices’ for “an edge bounding a 
base.”’ The surface Tp is in general different. 
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Periodicity of the continued fraction derived from the suite of Hermite. The 
preceding theorems establish the facts concerning the continued fraction 
whose convergents are the fractions of Hermite. The generating curve is 
algebraic and even when C’ intersects an edge does not lie on the exceptional 
surface mentioned in Section 19. Furthermore it is perpendicular to the 
z-plane and lies always inside the cone Ip of Section 20 when that cone exists. 
Hence the 

THEOREM. The continued fraction whose convergents are the fractions of 
Hermite is always periodic when w is a hyperbolic irrational. It is periodic 
when w is a loxodromie irrational except when the circle through w, w’ orthogonal 
to the z-plane intersects an edge adjoining a base of some pentahedron. 

21. Periodicity when w is rational. If w = p/q, a rational, we know that 
the fraction terminates unless the tangent to the curve at w lies in the z-plane, 
and that it is not periodic if the curve has contact with the z-plane of higher 
order than the first. We shall treat then curves of simple contact. Let C 
be the osculating circle of the curve at w. Then C is tangent to the z-plane, 
but does not lie in that plane. Obviously if C lies entirely within the penta- 
hedra with peaks at p/q the same will be true of the generating curve after a 
certain point and the fraction will terminate. 

Under what circumstances is C a fixed circle for some parabolic trans- 
formation with w as fixed point? These transformations are those carrying 
some pentahedron with peak at p/q into some other. If we make the trans- 
formation 2’ = (qoz — po)/(qz — p), where pgo — pog = — 1, the penta- 
hedra with peaks at p/q become those with peaks at o. The circle C becomes 
a straight line L parallel to the z-plane. The parabolic transformations carry- 
ing the pentahedra with peaks at © into themselves are 2’ = z + 6 where B 
is a complex integer. The fixed lines are those parallel to the line joining the 
origin to the point z = 8; that is, parallel to Bz — Bz = 0, or if we like parallel 
to the line 


B(z — qo/q) — B(2 — qo/q) = 0 


through qo/q, which is parallel to the preceding. The fixed lines for the 
transformation z’ = z + 6 meet this line at «©. Transforming back (putting 
(qoz — po)/(qz — p) for z in this equation) we get the line 


Bg? (z — p/q) — BF (2 — p/q) = 0. 


The parallel lines previously mentioned become circles through p/q tangent 
to this line. Giving 8 all integral values we get the lines to one of which C 
must be tangent in order to be a fixed circle of some parabolic transformation 
with p/q as fixed point. 

The preceding line is not less general on account of the factor q’. Putting 
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B= B’ g’, an integer, we have 
B’ (z — p/q) — — B/G) = 9. 


That is, the tangents to the fixed circles at p/q are parallel to the lines joining 
the origin to all complex integers 8’. Otherwise put, it is necessary and suf- 
ficient that the slope, dn/dé, of the tangent to the circle be rational. 

Let the slope dn/dé of the tangent to the generating curve be rational. Then 
C is a fixed circle of some transformation 7, and we see just as in the pre- 
ceding cases that the peaks of the pentahedra through which C passes fall 
into a recurrence scheme Pn+x/Qnik = I'(pn/gn). If C intersects no edge 
bounding a base and is tangent to no base a circle K can be constructed with a 
point on C as center and lying in a plane perpendicular to C, such that the 
circles tangent to C at the point w and passing through the points of K form a 
surface which intersects the same bases and in the same order as does C. 
These circles possess simple contact with C whereas the generating curve has 
contact of a higher order, hence after a certain point the generating curve lies 
entirely within this surface. Its convergents then satisfy the relation stated 
above, and the fraction is thus periodic. 

If C intersects an edge or is tangent to a base we cannot infer periodicity. 
If however the generating curve is algebraic we have, as in the hyperbolic 
case, periodic fractions in general in this case also. 

If the tangent to the generating curve has not a rational slope the fraction 
is not periodic, for the convergents can not satisfy after a certain point a 
relation of the form Pnix/qn4z = I'(pn/qn). This is easiest seen by con- 
sidering the line Z and the transformations 2’ = z + 8. If the slope of L is 
not rational the repeated application of a transformation of this form will 
carry any pentahedron which L intersects to a distance as great as we like 
from L. 

We have established then the following 

THEOREM. [If the generating curve has simple contact with the z-plane at the 
rational point w, the fraction terminates if the osculating circle C of the curve at 
the point w lies within pentahedra with peaks at w. It is not periodic if the slope, 
dn/dé , of the tangent to the generating curve is irrational. 

If the slope is rational the fraction is periodic unless C intersects an edge 
adjoining a base of some pentahedron or is tangent to some base. 

A similar theorem holds for the modified continued fraction with the change 
of phraseology previously mentioned. 

The periodicity of the continued fraction of Section 15. Here the osculating 
circle is the generating curve itself. Its radius, } V2, is sufficiently great to 
prevent the termination of the fraction. The circle lies in a plane parallel to 
n = 0, hence dn/dt = 0, a rational. The fraction is periodic unless the 
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circle meets the circle in which the lateral face of some pentahedron inter- 
sects the sphere of its vertices. This exception is a trivial one. For the 
generating circle likewise intersects the successive transforms of this circle 
on the sphere of vertices. We have thus at regular intervals in the formation 
of the fraction two possible values of the next convergent at our disposal. 
If the choice be properly made each time the fraction will be periodic. With 
this understanding for the exceptional case, the continued fraction is periodic 
for all rational values of w. 

22. The number field [1,¢]. All the results of this paper can, with suit- 
able modifications, be extended to the number field in which an integer is 
defined to be a number of the form m + ne, where m and n are real integers 
and e = } + 3iv3. Bianchi* has found the fundamental polyhedron of the 
group 2’ = (az + B)/(yz + 6), ad — By = 1, where a, B, x, 6 are integers 
of this field. Three of the polyhedra into which the half-space is divided 
form the heptahedron whose lateral faces are the six planes § = +3, +7 v3 
= +1, &—~7vV3 = +1, and whose base is the unit sphere 22+ = 1. 
This heptahedron plays the réle of the fundamental double pentahedron of 
the preceding pages. 

THe UNIvERsITy OF EDINBURGH, 
April, 1917 
 *Mat hematische Annalen, vol. 38 (1891), pp. 313-333. 
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ON THE IMAGINARY ROOTS OF A POLYNOMIAL AND THE REAL 
ROOTS OF ITS DERIVATIVE* 


BY 
HENRY BEDINGER MITCHELL 


1. Introduction. If a polynomial with real coefficients is found to have 
bend points exterior to the interval within which its real roots must lie, or 
if its derivative vanishes more than once in an interval between two consec- 
tive real roots, the existence of imaginary roots is immediately evident. It 
would seem probable that such marked features of the graph, as thus instantly 
reveal the presence of imaginary roots, should also set limits, in the complex 
plane, to a region within which one or more of these roots must be contained; 
and the question arises as to what we can infer, without laborious calculation, 
as to their position. In the theorems which follow, we shall seek a partial 
answer to this question, and shall therefore be concerned, not with the problem 
of obtaining the closest possible approximations, but rather with the inferences 
that can be drawn from certain minima data, that are to include the existence 
of a bend point, where the curve is not concave toward the X-axis, and its 
position relative to the adjacent real root or roots. 

2. Notation. Let 


(1) = + + Apia + An 


be a polynomial having real coefficients, and let 


(2) = (4 — 11) — — tm) Sm), 
(3) =[(a-—a,)?+ bi] (2k =n—m) 


be the products of the factors corresponding, respectively, to its real and 
imaginary roots, so that 


f(x) = Aog(x)- 
We shall then denote by L(x) and A(x), respectively, the logarithmic 
derivatives of d(x) and y (2), so that 

f' (2) 
(4) F(a) 


* Presented to the Society, October 28, 1916. 
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= L(x) +X(z), 
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m 


L(z) 


— 


2(2 —a;) 

Ma) 

3. TheoremI. If f(x) be a polynomial having real coefficients and at least 
one real root, and if f’ have a real root p exterior to the interval 1} = 2% Stn, 
between the least and the greatest real root of f , then there exists at least one pair 
of imaginary roots of f, a+ %ib, whose real and imaginary parts satisfy the 
inequalities 

(7) 


or 


0<b=h(p — tm) (when p > rm), 


(8) 
0<b=h(n — p) (when p < 7), 
where 2h is the number of imaginary roots whose real part is greater than p in 
the first case, and less than p in the second case. 
That f has imaginary roots is an immediate consequence of Rolle’s Theorem. 
As p is a root of f’ that is not a root of f, we may write, from (4), 


L(p)+X(p) = 9. 
When p > Tr» this gives 


The separate terms of the last sum are positive or negative according as a; => p, 
and as the whole sum is positive at least one of its terms must be so also. 
Denoting by h = k the number of such positive terms, and by a + ib the 
pair of imaginary roots corresponding to a term that is at least as great as 
any other, we have 


1 a-—p 2h 
9 0 <—— =2h 


whence 
0<a—p<2h(p—tm). 


To obtain the inequality for b we have only to note that 


2(a—2) 
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has the maximum value of 1/6, so that 


and therefore, by (9), 
0<b=h(p—tn), 


which completes the proof for the case where p > rm,-and a precisely similar 
procedure establishes the inequalities (8) for p < r1. 

CoroLttary. As 1 Sh =k, where 2k is the number of imaginary roots of f , 
we may substitute k for h in (7) and (8), which gives the form in which the the- 
orem must be used when we have no indication of how the imaginary roots of f are 
separated. 

4. Theorem II. If f(x) be a polynomial having real coefficients and at 
least one real root, and if f’ have a real root p exterior to the interval r) = x S rm 
between the least and the greatest real root of f , then f’ has an even number of such 
roots, without and on the same side of the interval, and there exists at least one 
pair of imaginary roots of f, a +18, whose real and imaginary parts satisfy 
the inequalities 

|p2 — a| < B S (p2 — Tm) V2k, 
(10) vi 
| po a|= = Tm ) 


where pz > Tm is the greatest real root of f’ , or 
— a| BS (m1 — pz) 


vk 
|pp — alS(n—- p2) 


where po < 1; is the least real root of f’ , and 2k is the number of imaginary roots 
of f. 

It will be sufficient, as before, to establish the first set of inequalities, the 
proof for the second case being similar save for the change of sign. From (4) 
we have 


For values of x greater than both r,, and the greatest a in A, both L and X 
are positive. For sufficiently small values of the positive quantity 6, Z +X 
is also positive when x = r,, + 6 and is finite and continuous for all greater 
values of x. Therefore if f’, and so L + X, have one real root greater than rp, 
it must have an even number of such roots, and at the greatest of these the 
slope of Z + d cannot be negative. 
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We have, therefore, 


L'(p2) +’ (p2) 20, 

+ (p2 — ai)? P 

As the last sum is positive, at least one of its terms must be greater than zero. 
Denoting by a + i@ the pair of imaginary roots corresponding to a positive 
term that is at least as great as any other, we may write 


B? > (p2 — 


1 

(p2 — Tm)” + (p2 — a)?] 

If we regard the last expression as a function of 8, its maximum value is 

found to be 


whence 


2 


p2 — )” 


and 


0 


I: 
4(p2 — a)? 


and we observe that it is also less than, or at most equal to, 24/8”, from 
which we obtain 
(p2 — Tm)? ~ 4(p2 — 
(p2 — tm)? ~ 6’ 
whence 


|p2 — a|= (p2 — Tm) 7. 


|p2 — al < BS (p2 — tm) V2k. 


5. Theorem III. Jf a polynomial, f(x), having real coefficients, have at 
least one real root and not more than two imaginary roots, and if f’ have a real 
root p greater than the greatest real root of f or less than its least real root, then f’ 
has two and only two such roots, p, and p2, both being greater than the greatest 
real root of f or both less than its least real root, and f has a pair of imaginary 
roots, a + tb, such that 


(12) if 
or 
(13) O<p-a<bSn-m tf 


so that, of the projections on the X-axis of the two exterior bend points, the one 
lies nearer to the real part of the imaginary roots of f than the other does to the 
nearest real root of f . 


+ 
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It will be sufficient to observe that as f has but a single pair of imaginary 
roots, the inequalities of Theorems I and II must alike apply to that pair 
forh=k=1. 

CoroLtuary. By Theorem II, we may also write for k = 1, 


3p2 — Tm 


(14) whens tm < pi 
3p2 — 


when = pi <1, 


(15) > a= 
so that, when f has but a single pair of imaginary roots, if the interval on the 
X-axis from the most remote exterior real root of f’ to the nearest real root of f be 
divided externally in the ratio of — 1:3, the real part of the imaginary roots 
of f will lie between the root of f’ and the point of division. 

6. Theorem IV. If f(x) be a polynomial having real coefficients and not 
more than eight imaginary roots, and if f’ reduce to zero more than once in an 
interval between two distinct, consecutive, real roots of f, rn <x < 1;, then there 
exists a pair of imaginary roots of f, a +8, such that a is included in the 
interval, and B is not greater than its extent: 

(16) and 

As f’, and so L + X, reduces to zero more than once in the interval, there 

must be an intermediate point x = c, for which L’(c) + ’(c) = 0, or 
bi — —a;)? 


= 2 
i=] [ + (c 


1 1 
at (ec — ("mi Tr)? 


As the first member of (17) is positive, one or more of the terms of the second 
member must be positive also. Denoting by a + 78 the pair of imaginary 
roots of f corresponding to a positive term that is at least as great as any 
other, we have, as in Theorem II, remembering that 2k = 8, 

9 bi — (ec — a;)* — (ce ay 1 
[bi + (¢ — ai)? P if + 4(e- af af” 
19) 

8 

—a)’P 
Combining (19) with (17) and (18), we have 


== 


2k 


4 
é=1 (e r; 
But 
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Thus 1/(c — a)? is greater than both 1/(c — r,)? and 1/(¢ — r;)?, and 
hence |¢ — a@| is less than |¢ — r,| and less, also, than |e — r;|. As ¢ lies 
within the interval from r, to r;, so also does a. 

CoroLitary. When f(a) has but two imaginary roots, then 


(20) B=-5 


7. Theorem V. If f(x) be a polynomial having real coefficients, and if f'’ 
vanish more than once in an interval r, < x < 1; between two distinct, consecutive 
real roots of f , the real parts of the imaginary roots of f cannot all lie without and 
on the same side of the interval, unless the number of real roots on the opposite 
side be greater than eight,* and unless the number of imaginary roots be also 
greater than eight, the degree of the polynomial being thus at least twenty. 

The necessary and sufficient condition that f’ should vanish more than 
once in the interval is that there should be a point c, r, < ¢ < r;, for which 


L(ce)+2A(e) = 0, L’'(e) +N (ce) 20. 


As L’(x) <0 for all values of x, we may rewrite these conditions in the 
form 


(21) A(c) = —L(e), (c) >0, 


from which it follows that there must be a point of intersection of the two 
curves y = A(x), y = — L(x), at which the former is rising, and where its 
subtangent is less than or at most equal to the subtangent of the latter in 
absolute value. 

The subtangent to \ at any point is 


2 
imi [65 + (2 — a;)?] 
As we require that every a is to be without and on the same side of the interval, 
the terms of the numerator are all of the same sign for x = c, and, denoting 
by a@ the a nearest to c, we may write 


. le-—a| 1 
bi + — a;)? 1 
° 
(22) > al 


 * The opposite end-point of the interval, r, or r;, is included in this enumeration. 


4 
k 
2> a; 
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Thus the subtangent to \ at the point c is greater in absolute value than 
the distance between c and the nearest real part of an imaginary root of f, 
and as all of these are to lie without the interval while ¢ lies within it, we must 
have 
X(e) 

(ec) 
But by (21) this requires that we should also have 


(23) if L(e)>O0, 


>a-—e>r—e, if sothat A(c) <0, 


if c>aZza, sothat A(c)>0. 


L(c) 


(ec) >e—rm, if <0. 


(24) 


We proceed, therefore, to determine the conditions under which these latter 

inequalities can subsist; i. e., in which the tangent drawn to y = L (2) at an 

interior point of the interval can intersect the X-axis outside of that interval. 
The intercept on the X-axis of the tangent to y = L(x), 


25) 


may, for a fixed value of x, be regarded as a function of the roots, r1, ---, Tm. 
Taking the partial derivative of X with respect to any one of them, we have 
ax L’ 2L 


As L’ < 0 for all values of z and r, the first term of the second member is 
always positive. When L > 0 and r >2, or when L <0 and r < z, the 
second term is also positive and (0X/dr) >0. Therefore for any point 
where L is positive, X will be increased by removing indefinitely to the right 
any real root of f that lies to the right of that point; and for any value of x 
for which L is negative, X will be made less if we remove indefinitely to the 
left any real root that lies to the left of x. But to remove a root indefinitely 
to the right or left is ultimately to cause the corresponding term in LZ and L’ 
to vanish. Therefore we obtain an upper and a lower limit for the intercept 
Trans. Am, Math Soc, 4 
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of the tangent drawn to y = L(x) at any point of the interval by disregarding, 
for the upper limit, all the real roots greater than r;, and, for the lower limit, 
all the real roots less than r;,. 

But when L > O andr < z, or when L < O andr > z, the second member 
is composed of two terms of opposite sign, and becomes zero when 


2L 


which gives us the maximum value of X in the first case, and the minimum 
in the second. Substituting this value for each r that is less than z in the 
expression (25) for X , and removing indefinitely to the right all-the real roots 
greater than r;, we have, as the greatest value which X can assume when L is 
positive, 

in 2L + 

hint 


where h denotes the number of real roots of f that lie to the left of z, including 
the root r, at the beginning of the interval. This gives us, 


L’ 2-7; 
4L 2-7, ~’ 
or, for x = ¢, 
h 1 L(c) 1 
(27) 4 ——( 
If now, as (23) requires, we are to have 
I(e) 


each of the terms of the second member of (27) must be greater than unity, 


and therefore 


(28) 


and the degree of @ must be at least 10. 

A precisely similar substitution shows that X cannot fall to the left of r;, 
unless the number of real roots to the right of the interval is greater than 8, 
and so, as before, the total number of real roots is at least 10. 


J 
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That the number of imaginary roots of f must be greater than 8 follows 
from Theorem IV, and the proof is complete. 

8. Applications and Examples. To illustrate the application of these 
theorems we may consider the following simple examples. 

Example 1. <A graph of the polynomial 


3a4 — 282° + 722? — 110 
shows bend points at x = 3 and x = 4, whereas its greatest real root lies 
between 1.9 and 2. By Theorem III, we then have for its imaginary roots, 
4<a<i.l, 0<b<1.1. 
Example 2. The equation 
x® — 5a* + 102* — 202” + 40x — 26 = 0 


has unity for its sole real root, as its derivative vanishes for x = 2, 2, and 
+iv2. Of the four imaginary roots, one pair must have its real part nega- 
tive; and for the other pair we may write by Theorems I and II, 


2<a< 2.71, 1}. 
Example 3. The equation 
482° — 852° + 15x + 22 = 0 


has unity for a double root, its remaining real root lying between — 1.3 and 
— 1.4. Its derivative vanishes for x = +1 and +}. By the corollary to 
Theorem IV, we have for the imaginary roots 


—14<a<'l, 0<b<1.2. 


When the roots of the polynomial itself are known, these theorems may 
often be useful in determining the nature of the roots of the derivative. 
Example 4. The equation 


f(z) = + 3P-[(x +3P =0 


has the real part of its imaginary roots outside of the interval between its 
least and greatest real roots. As the number of imaginary roots is not greater 
than eight, by Theorem IV the derivative can vanish only once within that 
interval. It cannot vanish on the boundary, as the real roots are not multiple. 
It cannot vanish outside the interval, for if p > were a real root of f’, by 
Theorem I we should have p < 1 and b = }, whereas, on the contrary, b = v3. 
Therefore the derivative can have but one real root; and by symmetry it is 
zero. 


4 
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Example 5. The equation 
f(z) = (2 +1 1)? +3] =0 


differs from that of Example 4 only in the multiplicity of its imaginary roots, 
but its derivative vanishes at least once between — } and 0, and at least once 
between 0 and + {, as well as at the origin. 

Example 6. The equation 


x(x — 1)" [(x + + = 0 


has all its imaginary roots without and on the same side of the interval between 
its real roots; but because of the high multiplicity both of the imaginary roots 
and of the real root on the opposite side of the interval, the derivative vanishes 
at least once between 0 and }, and at least once between } and 1, as well 


asatz =}. 
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RELATIONS ENTRE LES NOTIONS DE LIMITE ET DE DISTANCE* 
PAR 


MAURICE FRECHET 


POSITION DU PROBLEME 


1. Un certain nombre de propriétés des ensembles Jinéaires peuvent étre 
étendues aux ensembles abstraits, c’est 4 dire aux ensembles dont les éléments 
sont de nature quelconque ou inconnue. 

C’est ainsi que G. Cantor a pu développer la théorie des nombres cardinaux 
et ordinaux qui s’applique aussi bien aux ensembles abstraits qu’aux ensembles 
linéaires. C’est ainsi qu’on peut donner une définition de |’intégrale d’une 
fonction étendue 4 un ensemble abstrait en généralisant d’une facon presque 
immédiate une définition donnée par J. Radon pour |’intégrale étendue 4 un 
espace a un nombre fini de dimensionst comme je |’ai montré récemment.t 

2. Mais si l’on veut aller plus loin, on est amené a introduire la notion de 
limite ou méme la notion de distance tout en s’efforcant de les présenter sous 
une forme qui ne fasse pas intervenir la nature des éléments de la classe d’élé- 
ments considérée. 

En ce qui concerne la notion de limite, on peut se contenter d’admettre 
qu'il existe une régle d’ailleurs quelconque ou inconnue permettant de décider 
si une suite infinie d’éléments A,, Az, ---, An, -+-, de la classe considérée 
converge ou non vers un élément déterminé A. On se bornera a supposer 
que ce critére est tel que 1° si les éléments de la suite sont identiques, la suite 
converge et converge vers A;; 2° si une suite {A,} converge vers A, toute 
suite infinie extraite de la suite {A,} converge aussi vers A. 

De telles classes d’ensembles, que j’ai appelées classes (L), ne sont pas 
moins générales qu’une classe abstraite queleonque. Rien n’empéche en effet 
de considérer une classe abstraite arbitraire comme classe (LZ) en s’y prenant 
par exemple de la maniére suivante: on porte son attention sur une suite 
arbitraire A, Ai, Az, -*:, An, -*:. Une suite d’éléments distincts sera 
considérée comme convergente si c’est la suite A;, As, +--+, An, OU Si 
elle est extraite de cette suite, et dans les deux cas la suite est dite converger 
- * Presented to the Society, April, 1917. 

tJ. Radon, Wiener Sitzungsberichte, vol. 122, 2a (1913), pp. 1295-1438. 

tM. Fréchet, Bulletin de la Société Mathématique de France, 


vol. 43 (1915), pp. 248-265. 
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vers l’élément A. Dans le cas contraire, la suite est considérée comme 
divergente. 

Ceci étant et moyennant une extension convenable des définitions ordinaires, 
on peut développer une théorie de la connexion des ensembles abstraits dans 
les classes ot une telle définition de la limite est adoptée* ou encore généraliser 
quelques théorémes sur les fonctions également continues. t 

3. Mais si l’on veut généraliser des théorémes plus précis de la théorie des 
fonctions, une définition aussi indéterminée de la limite ne suffit plus. J’ai 
done été amené, dans ma Thése, a étudier l’effet de restrictions de plus en 
plus grandes apportées a l’idée de limite, ces restrictions restant toutefois 
telles que toutes les définitions classiques de la limite dans les ensembles les 
plus importants (ensembles de points, de courbes, de fonctions continues, etc.) 
y satisfont par avance. 

La notion qui intervient de la facon la plus essentielle pour faciliter l’étude 
de la limite dans la théorie des ensembles linéaires est la notion de distance. 
Réduite a ses caractéristiques abstraites, on peut la décrire ainsi: 

Dans une classe (L), on peut définir la distance si 4 tout couple A, B d’élé- 
ments de la classe correspond un nombre (A, B) = (B, A) = 0 jouissant 
des propriétés suivantes: 

1° (A, B) n’est nul que si A, B coincident; 

2° si, A restant fixe, la suite {B,} converge vers A, (A, B,) doit converger 

vers zéro et réciproquement; 

3° on a, quels que soient les éléments A, B, C de la classe, 


(B,C) = (A, B) + (A,C). 


J’appellerai un tel nombre la distance de A a B et je dirai d’une classe (LZ) 
ol on peut définir une distance que c’est une classe (D) .t 

Il faut tout d’abord remarquer que les conditions 1°, 2°, 3° a elles seules ne 
peuvent apporter aucune limitation & Varbitraire de la nature des éléments de la 
classe. Considérons en effet la catégorie la plus générale des classes d’éléments 
abstraits. “On peut d’abord définir au moins d’une fa¢on une nombre satis- 
faisant 4 1°, 3°—il suffit par exemple de prendre pour (A, B) la valeur 1 
quand A, B sont distincts, 0 quand ils coincident.§ Une fois choisie une 
définition de la distance satisfaisant 4 1° et 3°, si l’on définit précisément la 
limite par la condition 2°, on aura bien une classe (D). 

4. Le probléme qui se pose n’est donc pas de savoir a quelle condition une 
classe abstraite est une classe (D), mais 4 quelles conditions simples et 
" *M. Fréchet, Mathematische Annalen, vol. 68 (1910), pp. 145-168. 

+ M. Fréchet, These, Rendiconti del Circolo Matematico di Pa- 
lermo, vol. 22 (1906), pp. 1-74. 

t Je modifie donc ici et dans la suite les dénominations adoptées dans ma Thése. Ce que 


j’y ai appelé écart et voisinage je l’appellerai ici distance et écart uniformément régulier. 
§ Voir Note Supplémentaire A 4 la fin de cette memoire. 
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indépendantes une classe (L) ot les suites convergentes et leurs limites sont 
déja définies est-elle une classe (D) et comment on y peut calculer la distance 
(A, B). 

Ainsi, Vobjet de ce travail est le suivant: sachant qu’on peut toujours de bien 
des fagons définir dans une classe d’éléments abstraits les suites convergentes et 
leurs limites, il s’agit de déterminer & quelles conditions supplémentaires il faut 
assujettir ce choix pour que l'on puisse définir sur cette classe une distance telle 
que la convergence définie d’avance ne soit pas modifiée quand on la définit au moyen 
de cette distance. 


REDUCTION DU PROBLEME 


5. Pour résoudre ce probléme, nous porterons d’abord notre attention sur les 
conditions 1° et 2°. Si dans une classe (LZ), on parvient 4 définir un nombre 
(A, B) = (B, A) = 0 satisfaisant aux conditions 1° et 2°, on dira que cette 
classe est une classe (E), ou encore qu’on a pu y définir un écart,* a savoir le 
nombre (A, B), qui sera l’écart des éléments A et B. Si cet écart satisfait 
en méme temps 4 la condition 3°, ce sera en méme temps une “ distance.” 

D’autre part il est facile de vérifier que dans une classe (D) tout ensemble 
dérivé est fermé, c’est 4 dire que si une suite d’éléments {A,} converge vers 
Vélément A et si chaque élément A, est la limite d’une suite convergente 
A™, AM, «++, AM®, +++, on peut extraire de l’ensemble 4 double entrée 
des AS” une suite Af”, AS”), ---, qui converge vers A. Car 

(A, AM) (A, Am) + (Ans AM), 

et en prenant n; puis p; assez grand, on rend le second membre et par suite 


le premier aussi petit que l’on veut. 
Nous dirons d’une classe (LZ) ot tout ensemble dérivé est fermé que c’est 


une classe (S). 


6. Neus evens dés lors obtenu deux conditions nécessaires pour qu'une classe 
(L) admette une définition de la distance, c’est 4 dire soit une classe (D). 
Il faut que cette classe (LZ) soit une classe (£) et en méme temps une classe 
(S). 

Il est facile de voir que ces deux conditions sont indépendantes. 

Considérons en effet la classe formée des points d’un plan définis comme il 
suit par leurs coordonnées cartésiennes: 

pour le point Ay: x = 0, y = 0; 

pour un quelconque A, des points Ai, Az, An, = 1/n,y = 0; 

pour un quelconque A‘ des points de la suite Al’, AP, AY, «++, a 

double entrée: x = 1/n, y = 1/p. 

* J’introduis ici une notation (EZ) et un mot “ écart”’ que j’ai emvloyé dans ma Thése, 

14 ot j’emploie maintenant la notation (D) et le mot distance. 
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Nous pouvons définir ainsi un écart de deux points quelconques de la 
classe; il sera pris égal: 

a l’unité si l’un des points est Ag et l’autre l’un des points A”; 

a la distance géométrique des deux points sur le plan dans tout autre cas. 
Ceci étant nous pouvons considérer la classe comme une classe (L) et en 
méme temps comme une classe (EF) en y définissant la limite précisément par 
la condition 2° énoncée plus haut. 

Cependant elle ne peut étre une classe S, bien évidemment. 

Inversement une classe (S) peut ne pas étre une classe (EF). En effet 
reprenons la méme classe d’éléments, mais cette fois définissons autrement la 
convergence d’une suite. Une suite d’éléments de la classe sera dite con- 
vergente vers l’un d’eux lorsque leur distance géométrique 4 ce dernier tend 
vers zéro, 4 moins que la suite ne comprenne les deux points A”, A®, cas 
ou la suite sera dite divergente. II est évident que dans cette classe tout 
ensemble dérivé y est fermé. Pourtant, on ne peut y définir un écart sans 
altérer la convergence des suites. Car dans ce cas, si on considére la suite 
A,, Ao, +++, An, +++, par exemple, elle converge vers Ao, done si |’écart 
existait, la suite (Ao, A1), (Ao, Az), «++, (Ao, An), «++, devrait converger 
vers zéro. Par suite, il en serait de méme de la suite de nombres (Ao, A”), 
(Ao, A), (Ao, A1), (Ao, Az), +++, (Ao, An), +++, et alors la suite de 
points A®, Ae, An, «++, convergerait vers A» contrairement 
a l’hypothése. 

7. Nous remarquons immédiatement que les exemples précédents montrent 
en méme temps que s? l’on a défini d’avance la limite dans une classe abstraite, 
il n’est pas toujours possible d’y définir la distance de deux éléments quelconques 
de facon que la limite antérieurement définie puisse aussi étre définie sans 
altération au moyen de la distance. Nous voyons aussi qu’il n’est méme 
pas toujours possible d’y définir un écart sans altérer les limites. 

8. Nous pouvons en outre décomposer en plusieurs autres conditions indé- 
pendantes la condition pour une classe (L) d’étre une classe ( E). 

Si une classe est (E), la condition nécessaire et suffisante pour qu’une 
suite d’éléments { A,} converge vers |’élément A est que pour g donné on puisse 


trouver p tel que 
1 


4) 


pour n > p. Autrement dit si l’on appelle >, l’ensemble des éléments B 
tels que (4, B) < 1/q, il faut et il suffit que quel que soit q, la suite des A, 
soit contenue dans =, a partir d’un certain rang variable avec q. 
Remarquons d’ailleurs que 2,4: fait partie de 2, et que 21, D2, --- ont 
un seul élément commun savoir A. 
On est alors amené a la définition suivante. Soit une classe (LZ); on dira 


q 
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que c’est une classe (V) ou encore qu’on peut y définir wn voisinage* lorsque 
la condition suivante est réalisée: 

Pour tout élément A, il existe une suite d’ensembles U,, U2, ---, Ug, ++: 
telle que la condition nécessaire et suffisante pour qu’une suite {A,} converge 
vers A est que quel que soit q, la suite {A,} fasse partie de l’ensemble U, a 
partir d’un certain rang (variable en général avec q) . 

On remarque aussité6t, que s’il en est ainsi l’élément A est le seul élément 
commun U;, U2, ---, Ug, «++, et qu’en appelant 7, l’ensemble U, - U2 - 
des éléments communs a U,, U2, ---, U,, l'ensemble 7, est contenu 
dans et la suite des ensembles ---, T,, peut remplacer la 
suite des ensembles U;, U2, «--, Ug, 

Ainsi on peut supposer que U, appartienne 4 U,_,; et on doit se souvenir 
que A est le seul élément commun aux U,. Lorsqu’il y a lieu de distinguer 
les U, relatifs 4 un élément A de ceux relatifs 4 un élément B, on désignera 
les premiers par UY et les seconds par US”. 

Nous pourrons donc dire qu’un élément B est trés voisin de A lorsque B est 
dans un ensemble US d’indice trés grand. Ou encore nous pouvons dire 
que les ensembles UY, U{", ---, constituent successivement des voisinages 
de plus en plus étroits de A. 

9. Nous avons alors a résoudre la question suivante: soit une classe (L) ot 
les suites convergentes et leurs limites sont déterminées d’avance, est-il toujours 
possible d’assigner 4 chaque élément A une suite d’ensembles 7, TY, ---, 
déterminant des voisinages de plus en plus étroits de A, de sorte que les 
suites convergentes et leurs limites ne soient pas modifiées? 

Nous avons aussi a résoudre la question suivante: si dans une classe (V ) 
le voisinage autour de chaque élément est défini d’une certaine facon, est-il 
toujours possible d’assigner un écart 4 tout couple d’éléments de |’ensemble, 
de sorte que la convergence et la limite éventuelle d’une suite ne soient pas 
modifiées, qu’on définisse celles-ci par le moyen du voisinage ou par celui 
de l’écart? 

Nous allons voir que la réponse & ces deux questions est négative et nous allons 
déterminer quelles sont les conditions supplémentaires que doit vérifier la 
définition de la limite pour qu’on puisse la définir au moyen soit du voisinage, 
soit de l’écart. 

10. La réponse a la premiére question est négative, c’est a dire qu’une classe 
(L) étant donnée, on ne peut pas toujours y définir un voisinage* sans altérer 
la convergence ou la limite de ses suites d’éléments. 

Pour qu’une classe (L) puisse étre considérée comme une classe (V ), sans 
modifier la convergence des suites d’éléments ni les limites des suites con- 
vergentes, il faut et il suffit que: 


* J’emploie ici la notation ( V ) et le mot de voisinage dans un sens plus général que celui que 
avais adopté dans ma Thése. Voir aussi Note Supplémentaire B 4 la fin de cette mémoire. 
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1° toute suite convergente d’éléments de la classe (LZ) reste convergente 
quand on lui ajoute un nombre fini d’éléments de la classe; 

2° pour tout élément non isolé A de la classe, toute suite o d’éléments 
A’, AY’, «++, convergeant vers A et tout entier n, il existe un entier ¢? 
jouissant de la propriété suivante: Soient o1, a2, ---, une suite de suites 
(distinctes ou non) convergeant vers A et 71, m2, «++, une suite d’entiers. 
Si quel que soit l’entier N on peut trouver l’entier 7 tel que pour i > io, 
n; > alors la suite A”, converge aussi vers A. 

Ces conditions sont nécessaires. Car si la classe est (V ), la limite A d’une 
suite A,, deo, ---, est définie de la facon suivante. II existe des ensembles 
T,, «++, T,, +++, chacun dans le précédent, n’ayant en commun que 
élément A, tels que si Az, , converge vers A, cette suite est contenue 
dans chaque 7’, a partir d’un certain rang. S’il en est ainsi, cette condition 
sera encore réalisée si on ajoute un nombre fini d’éléments de la classe a la 
suite, donc la convergence et la limite de cette suite n’en seront pas modifiées. 
D’autre part soit ¢‘’ le rang du premier élément de la suite A\”, AY, ---, 
A’, +++, a partir duquel les termes de cette suite supposée convergeant 
vers A, sont tous compris dans 7',. Et supposons que quel que soit N on 
peut trouver 7% tel que n; > g¥ pour i > io. Alors A‘? est dans 7'y pour 
i > io; autrement dit, quel que soit N, la suite A‘, A, ---, appartient 
a 7 ya partir d’un certain rang 7%) variable avec N. Elle est donc convergente 
et sa limite est A. 

Inversement, les conditions 1° et 2° sont suffisantes pour que la classe (L) 
soit une classe (V ). 

En effet; tout élément B de la classe appartient 4 au moins une suite con- 
vergeant vers A (par exemple B, A, A, A, «++, A, +++); soit AW”, AY, 

-, A®,, B, +++, cette suite. Nous appellerons U“ l'ensemble 
des éléments B tels que dans une au moins des suites o tendant vers A et 
comprenant B celui-ci occupe un rang supérieur a g%’. Les ensembles U 
jouissent alors des propriétés voulues. Considérons d’abord une suite A’, 
AY, Al”, +++, convergeant vers A. Quel que soit n, l’élément 
occupera pour p assez grand (p= po) un rang supérieur a g”’. Done les 
éléments de cette suite 4 partir de A,, appartiennent 4 U“”. Inversement 
si les éléments de cette suite appartiennent quel que soit N 4 UY a partir 
d’un certain rang po (variable avec NV) c’est que pour p = po. A\” appartient 
4 au moins une suite (o,) convergeant vers A oi il occupe un rang r, > qf”. 
Autrement dit, étant donnée la suite 


( ) 
Ag», AY? ree, Av = AY tee, 


on peut pour chaque valeur de N trouver po, tel que p = po entraine r, > q'f” 
et alors d’aprés la condition 2° la suite précédente converge vers A. 


| 
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11. Les conditions 1° et 2° sont d’ailleurs indépendantes. 

On peut satisfaire 4 2° sans satisfaire 4 1° en prenant par exemple comme 
classe les points d’un segment BC extrémités comprises et en y définissant la 
limite comme d’ordinaire par la distance sauf que les suites comprenant le 
point B et dont la distance des points a B ne tend pas vers zéro seront toujours 
déclarées divergentes dans cette classe. Alors si on considére une suite con- 
vergeant vers A et si on lui adjoint B, elle cesse d’étre convergente; 1° cesse 
d’étre vérifié. Au contraire 2° est encore vérifié. Soit en effet une suite 
Aw, AY’, «++, convergeant vers A. On appellera ¢% le rang a partir 
duquel on a constamment AA‘? < 1/n. Alors si quel que soit N, on peut 
trouver io, tel que pour 7 > %, on ait n; > g¥°, c'est que AA? < 1/N 
pour? >i. Si A + B, aucune des suites o;, supposées convergeant vers A, 
ne contiendra B; par suite la suite A”, A, ---, non plus, et alors elle 
convergera vers A dans la classe. Si A = B, la présence de B dans les suites 
n’offre plus de difficulté. 

On peut aussi satisfaire 4 1° sans satisfaire 4 2°. 

Nous allons donner un exemple d’une classe (LZ) ov la limite est définie de 
facon a satisfaire aux trois conditions 1°, 2° du n° 3 et 1° du n° 10 sans qu’on 
puisse y définir un voisinage non seulement au sens de ma Thése mais méme 
au sens plus large adopté ici (n° 8). II suffit de montrer que la condition 2° 
du n° 10 n’est pas satisfaite. 

Cet exemple est celui d’une classe (Z) dont les éléments sont les fonctions 
f(x) uniformes sur le segment J (0 =z =1) et ow la limite est définie 
comme d’ordinaire. C’est a dire que f, (x) converge vers f(x) si ceci a lieu 
en chaque point de l’intervalle (uniformément ou non). Alors les trois con- 
ditions mentionnées plus haut sont satisfaites. I] faut montrer que la derniére 
condition 2° ne l’est pas.* 

Prenons pour simplifier le cas ot l’élément A est la fonction zéro (c’est a 
dire égale 4 zéro en tout point de l’intervalle J). Montrons que si pour 
chaque entier N et pour chaque suite o de fonctions convergeant vers zéro, 


fe (x), ff (2), «++, on détermine un certain entier g%, aucun choix de 
l'ensemble des entiers g% ne satisfera 4 la condition 2°. Considérons en 
particulier la suite o, formée des fonctions fi(2), 


de la maniére suivante. La fonction y = f} (2x) est de période 1/2" et elle est 
représentée par une ligne brisée dont les premiers sommets sont les points de 
coordonnées 


1 1 1 
(0,0); 2) (5-0); (5.0). 


On voit que les éléments de la suite ¢, convergent vers zéro quand p croit 
indéfiniment. A oa, et N correspond un certain entier g%; posons 


= Voir Note Supplémentaire C a la fin de cette mémoire. 
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Extrayons des suites ¢, la suite o: 


Pour chaque valeur de N on peut trouver 7 tel que pour n > 7 on ait pp, > q%. 
Il suffit de prendre i = N, car alors 


Pn > gr. 
Si done la condition 2° était vérifiée, la suite o convergerait vers zéro. 
Or on peut facilement trouver un point 2» ou fp, (x) ne tend pas vers zéro. 


En effet, on a évidemment 


1 3 
21 pour 1, 


= 


quand r est un entier quelconque prenant I’une des valeurs0,1,2,---,2" —1. 
Appelons J;,, lintervalle précédent. II suffit alors de prouver qu’il existe 
un point commun 4 une infinité des intervalles 
Or étant donné l’intervalle J5",,, prenons n’ = n + pp, + 2, puis r’ tel que 


Tn 1 Tn 3 

Alors I%,»,, est compris dans J%",,. On a done une succession d’intervalles 
pris dans la suite des J;*,, et chacun compris dans le précédent. Ils auront 
done un point commun 2. 

12. La réponse a laseconde question du n° 9 est aussi négative. Nous allons 
en effet donner l’exemple d’une classe (V ) d’éléments oi la limite est définie 
au moyen du voisinage et ne peut étre définie sans altération au moyen d’un écart. 

Considérons en effet la classe (V ) formée des points d’un segment ab et 
ot le voisinage d’un point d’abscisse xo est défini de la facon suivante. On 
appellera 75° l'ensemble des points du segment ab et 7*° (pour n entier = 1) 
l’ensemble des points du segment de longueur 1/n ayant 29 pour extrémité 
droite. Chacun de ces ensembles contient xo et contient le suivant . 

On voit évidemment que dans cette classe pour qu’une suite {x,} converge 
vers 2, il faut et il suffit que la distance géométrique 29 x, tende vers zéro 
et de plus que les points x, soient a partir d’un certain rang a gauche de 2 
(ou confondus avec 2). 

Il s’agit de montrer qu’aucune définition d’un écart (2, x’) dans cette classe 
ne peut conduire a la méme définition de la convergence. 

En effet supposons une telle définition possible. Alors si 2’ est un point 
arbitraire 4 droite d’un point fixe x |’écart (2, x’) aura une borne inférieure 
positive. En effet, dans le cas contraire, cette borne serait nulle et on pourrait, 
quel que soit m, trouver un point 2, 4 droite de x tel que (2, 2,) < 1/n. 
Par suite la suite {x,} située 4 droite de x, devrait converger vers x dans la 
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classe, ce qui est contraire a la définition actuelle de la limite. Soit done 
F(x) la borne inférieure positive considérée, bien déterminée pour chaque 
valeur de x dans ab. 

Pour tout point xo de ab et toute valeur de l’entier p, on peut déterminer 
un intervalle x) zo limité 4 droite 4 ce point zo et a l’intérieur duquel F (2x) 
<1/p. Car il suffit 4 cet effet de déterminer le point x, de sorte que pour 
x <2 <2, on ait (2,29) <1/p. Prenons en particulier x) = b; on a 
un intervalle 2; 6 dans lequel F(x) <1/p. Soit @ la borne inférieure des 
nombres 2) telle que dans l’intervalle x)’ b, on ait F(x) < 1/p, sauf peut 
étre en un ensemble dénombrable de points (il y a au moins un de ces inter- 
valles, A savoir x 6). Ona nécessairement a =a. Carsia < a, il suffirait 
de former a gauche de a comme plus haut un intervalle Ba dans tout |’intérieur 
duquel F(x) <1/p. Et alors l’intervalle 6b = Ba + ab serait un intervalle 
plus grand que ab, ot F(x) <1/p sauf en un ensemble dénombrable de 
points. 

Mais alors on voit que F (2) ne peut étre = 1/p qu’en un ensemble dé- 
nombrable E, de points de ab. Par suite F (2) ne pourrait étre positif que 
dans l’ensemble dénombrable de points qui est la somme des ensembles 
dénombrables E,; + E, + --- + E, + ---, et non en tout point de ab. 

13. On peut remarquer que dans le cas d’une classe (V ), on pourrait déter- 
miner une sorte d’écart dissymétrique. Appelons en effet ((A, B)) l’inverse de 
la plus haute valeur de n telle que le voisinage T7 contienne B. Ce nombre est 
=0, fini et bien déterminé si A et B sont distincts. Nous poserons ((A,B))=0 
si A et B coincident. On voit qu’en général on aura ((A, B)) + ((B, A)). 
Cependant, A étant un élément fixe, si B, tend vers A, ((A, B,)) ,—1mais 
non nécessairement ((B,, A)),—tend vers zéro, et réciproquement. 

14, Etant prouvé qu’une classe (V) n’est pas nécessairement une classe 
(E), nous allons déterminer maintenant @ quelle condition il en est ainsi. 

Nous supposons done que dans une classe (V), la limite soit définie en 
attachant 4 chaque élément A une suite d’ensembles 7%, TY, ---, TY, 
--+, définissant des voisinages de plus en plus étroits. Et nous cherchons a 
quelle condition doivent satisfaire ces ensembles 7“ pour que la limite ainsi 
définie puisse étre définie sans altération de la convergence ni de la limite 
par le moyen d’un écart (A, B) attribué 4 chaque couple d’éléments A, B, 
C, cet écart satisfaisant aux conditions 1°, 2° du n° 10. 

Cette condition est la suivante: il faut et il suffit que l’on puisse attacher a 
chaque élément A une suite d’entiers non décroissants et qui tendent vers 
Vinfini 
r4 


choisis de sorte que si l’on se donne arbitrairement un entier N et un élé- 
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ment A de la classe, on puisse déterminer un entier m pour lequel l’ensemble 
Te contient nécessairement |’élément A, si B appartient 4 7. 

La condition est nécessaire. En effet, soit une classe (V ) ow la limite peut 
aussi étre déterminée par le moyen d’un ecart (A, B). 

Comme on peut supposer que 7“ contient toute la classe, alors, pour 
chaque valeur de n, l’un au moins des ensembles 7(“, TS, ---, contient 
tous les éléments B tel que (A,B) <1/n. Si T® est précisément lun 
de ces ensembles nous poserons r4 = n; sinon nous appellerons r4 le rang le 
plus haut ( < n) de tous ces ensembles. 

On voit que dans tous les cas TA contient tous les éléments B tels que 
(B,A) <1/n et que r4 =r4,,. On voit aussi que r4 tend vers l’infini 
quand A est fixe; sans quoi il aurait une borne supérieure finie &. Et alors 
quel que soit n > k il y aurait en dehors de 74, un élément B,, tel que (B,, A) 
<1/n. Maisalors si la limite est définie par l’écart, B, tendrait vers A, et si 
elle est définie par le voisinage, B, ne tendrait pas vers A. 

Ceci étant donnons nous arbitrairement N et A. On peut choisir un 
entier m tel que pour tout élément B de 7%, on ait (B, A) <1/N. (Car 
dans le cas contraire, il y aurait dans chaque ensemble 7“ un élément C,, tel 
que (C,, 4) >1/N. Alors selon qu’on emploierait le voisinage ou l’écart 
C, tendrait ou ne tendrait pas vers A.) 

Or si (B, A) < 1/N, Te contient A. La condition est bien nécessaire. 

Inversement, supposons que dans une classe (V) l’existence des suites 
r4 satisfaisant aux conditions demandées soit assurée. Montrons qu’on peut 
aussi définir la limite dans la classe, sans altérer la convergence, au moyen 
d’un écart. 

Pour cela appelons qj la plus grande valeur de n telle que Ta contient B. 
Il y a au moins une telle valeur de n puisque Ta = 7 est formé de toute 
la classe. Alors qf est un nombre bien déterminé et fini si A et B sont distincts. 

Posons alors (A, B) = (B, A) = 0si A et B coincident et 


(A,B) =(B,A) =S 


dans le cas ot ils sont distincts. Alors on a défini pour tout couple d’éléments 
un nombre = 0 qui n’est nul que si les deux éléments coincident. II s’agit 
de montrer que si une suite B, tend vers A, (B,, A) tend vers zéro et récipro- 
quement. 

Tout d’abord si (B,, A) tend vers zéro, q%, croit indéfiniment; done, si 


(A) . 
N est donné, on a pour p assez grand (p > m): q%, > N; donc 7,4 contient 


B,, par définition de gg,. Mais si on s’est donné A et un entier a, on peut 
choisir N pour que rf > a et par suite 7 contient aussi B, a partir du rang m. 
Autrement dit, puisque a@ est arbitraire, B, tend vers A. 
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Inversement si B, tend vers A, et si a est un entier donné, B, est dans 
i partir d’un certain rang k; donc qj, > pourp >k. Done tend 
vers l’infini avec p. Pour montrer que (A, B,) tend vers zéro, il suffit 
done de montrer que q% tend aussi vers |’infini. Or dans le cas contraire on 
pourrait trouver un nombre positif \ tel que pour une infinité de valeurs de p, 
on ait g3? <2. Mais pour une telle valeur de p, Tt, ne contiendrait pas A. 
Or par hypothése, étant donnés A et d on peut trouver un entier m tel que 
si B, appartient 4 TY, Te contienne A. II faudrait donc que pour une 
infinité de valeurs de p, B, n’appartienne pas a 7‘, ce qui est contraire a 
la définition de la limite par le voisinage. 

15. Pour élucider complétement les relations entre les notions de limite et 
de distance, il reste 4 résoudre le probléme suivant: 

Nous avons vu que pour qu'une classe (ZL) soit une classe (D), il faut 
qu'elle soit (S) et (EZ). Ces deux conditions nécessaires, qui sont indé- 
pendantes, sont elles suffisantes? Et dans le cas de la négative, quelle est la 
nature de la condition supplémentaire? 

Ce probléme n’est pas résolu dans le présent mémoire. Nous nous con- 
tenterons des indications suivantes. 

16. On peut sérier les difficultés en passant du cas des classes qui sont a la 
fois (E) et (S) au cas des classes (DD) par l’intermédiaire des classes que j’ai 
appelées (V) dans ma Thése, notation a laquelle je n’attache plus ici le 
méme sens, le terme voisinage se trouvant plus adéquatement utilisé dans le 
présent mémoire. Nous appellerons ces classes non plus (V), mais ( E,), 
et nous dirons que l’écart dans ces classes est uniformément régulier. Les 
considérations suivantes vont expliquer la raison de ces dénominations et 
pourquoi les classes (,) s’introduisent naturellement dans la solution du 
probléme que nous venons de poser. 

17. Voyons done quelles seraient les propriétés d’une classe qui serait a la fois 
(E)et(S). Les conditions 1° et 2° vérifiées par une classe D restant remplies 
par une telle classe, il s’agit de savoir ce qui remplace la condition 3°. A cet 
effet démontrons le théoréme suivant: 

La condition nécessaire et suffisante pour qu’une classe ( E) soit (S), (autre- 
ment dit pour que tout ensemble dérivé soit fermé dans une classe ov la limite 
est définie a l’aide de l’écart) est qu’a tout élément A et a tout nombre positif 6, 
on puisse faire correspondre un nombre 7 tel que si B est un élément déterminé 
dont V'écart avec A soit moindre que n et si w est un nombre positif convenablement 
choisi une fois connus 6, n, A, B, alors pour tout élément C dont Vécart avec B 
est moindre que w, on aura* 

(A,C) <6. 

*On peut exprimer briévement cette condition en disant que si (A, B) et (B, C) sont 
des infiniments petits, (A, C) est aussi un infiniment petit. Mais la convergence n’est pas 
nécessairement uniforme. 
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En effet supposons d’abord que la classe soit (EF) et (S). Si le théoréme 
n’était pas vrai pour un élément A de |’ensemble et un certain nombre 6 > 0, 
la condition (A, C) < 6 ne serait pas nécessairement vérifiée et ceci quelle 
que soit la valeur adoptée pour 7. En particulier si l’on prenait » = 1/n, 
il y aurait au moins un élément B, tel que (A, B,) < 1/n, pour lequel aucun 
choix convenable de w ne saurait assurer la condition recherchée. C’est a 
dire que si l’on prenait par exemple w = 1/p, il y aurait un élément C®” tel 
que 


1 
(B,, CY) <w Dp et (A, CS") 


Mais, quand n restant fixe p croit indéfiniment, CS” tend vers B,. Et 
quand n croit indéfiniment, B, tend vers A. L’ensemble dérivé de l’ensemble 
des CS? comprend donc l’ensemble des B,. Et comme il est fermé par hypo- 
thése, il comprend aussi la limite A des B,. Or la condition (A, CS”) > 6 > 0 
montre que A ne peut étre limite d’une suite d’éléments C%. ' 

Réciproquement supposons que pour une classe (FE) la condition énoncée 
plus haut soit vérifiée. Alors tout ensemble E’ dérivé d’un ensemble quel- 
conque d’éléments de la classe est fermé. Autrement dit, si A est la limite 
d’une suite d’éléments B,, B., ---, Bn, de l'ensemble dérivé E’, A 
appartient a E’. En effet, chaque élément B, de FE’ est la limite d’une suite 
d’éléments C®, CY’, ---. Pour 6 donné, on peut trouver un nombre 7 tel 
que si (A, B,) < 7, il correspond a B, un nombre w, pour lequel la condition 
(B,, CS?) < w, entraine (A, CS”) <6. Or quel que soit 7 on peut prendre 
n assez grand pour que (A, B,) < 7, et quel que soit w,, on peut prendre 
pour n fixe p assez grand pour que (B,,C”) < wr. Les C%, CY, --- 
étant distincts ainsi que les B,, Be, ---, on voit qu’on peut finalement faire 
correspondre a toute valeur 1/7 de 6 deux entiers n;, p;, tels que les éléments 
cm, C&, +++ soient distincts et que (A, <1/i. Autrement dit, A 
appartient aussi a 

18. Si le nombre w ne dépend pas de |’élément B, alors en prenant pour e le 
plus petit des nombres 7 et w, on voit qu’a tout élément A et a tout nombre 
5 > 0, on peut faire correspondre un nombre ¢ tel que si B, C sont deux 
éléments satisfaisant 4 la condition 


(A,B) +(B,C) <e, etparsuitea (A,B) <n, (B,C) <a, 


on a toujours 
(A,C) <6. 


Appelons ¥( @, A) la borne supérieure, finie ou non, de (A, C) quand B, C 
sont des éléments quelconques tels que (A,B) +(B,C) =@>0. On 
voit alors que ¥(@, A) est un nombre > 0 qui tend vers zéro quand @ tend 
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vers zéro, A restant fixe. Car quel que soit le nombre 6 > 0 on peut trouver 
un nombre @ tel que si (A, B) + (B,C) <e€, ona (A,C) <4, et par 
suite si @< €, ¥(0, A) =6. Et ona quels que soient B et C 


(A,C) S=W((A, B) + (B,C), A). 


Si maintenant on suppose que ¥(@, A) est aussi indépendant de A, on 
obtient une classe qui est ce que nous appellerons une classe ( E,), ol on peut 
définir un écart uniformément régulier tel que l’on ait, quels que soient A, B,C, 


(A,C) Sy((A,B)+(B,C)). 


On voit que les classes (D), celles oti la distance est définie, correspondent 
au cas particulier des classes ot = @. Il faut remarquer d’ailleurs 
qu’on a toujours ¥(@) = @ comme on le voit en prenant C identique a B. 

Ces classes (E,) ne sont autres que celles que j’ai appelées classes (V ) 
dans ma Thése. On voit en effet que si l’on appelle ¢ (€) une fonction partout 
supérieure a y (2€) mais tendant vers zéro, on aura 


(A,C) < pour (4, (2, 


Notes SuppLEMENTAIRES.* 


A. Pour obtenir des distances infiniment petites, il suffirait de faire correspondre la classe 
4 un ensemble fini linéaire non isolé avec conservation de la distance par définition. Et au 
cas ot la puissance de la classe est supérieure au continu, de donner aux distances aboutissant 
aux éléments restants une valeur finie suffisamment grande. 

B. Dans une note récente aux Comptes-Rendus del’ Académie des Sciences 
(Septembre, 1917) je généralise une nouvelle fois en abandonnant la condition que les ensembles 
U forment une suite dénombrable. 

C. Il en résultera, en particulier, que dans cette classe si importante on ne peut definir un 
écart. On peut, d’ailleurs, donner de ce fait démonstrations directes plus simples. J’ai aussi 
fait remarquer, dans ma Thése, que cette classe n’est pas une classe (s) . 


*These notes were received from the author too late to insert with the pages to which 
they belong.—Tue Epirors. 
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ON THE FOUNDATIONS OF THE CALCUL FONCTIONNEL OF 


FRECHET* 


BY 
A. D. PITCHER AND E. W. CHITTENDEN 


In his thesis,t Fréchet gave a very beautiful generalization of the theory 
of point sets and of the theory of real valued functions of a real variable. 
His functions are real valued but the range of the independent variable is 
an abstract class © of elements g. He secures his results principally through 
the medium of a properly conditioned distance function 6, a generalization 
of the distance between two points, which associates with each pair qi q2 of 
elements a real non-negative number 6(q: q2). He is thus enabled to secure 
the more important theorems of point set theory and of real function theory, 
especially those relating to continuous functions and their properties. This 
theory of Fréchet has excited considerable interest and has received much 
attention from mathematicians. Various contributions to its foundations and 
to its content have been made. 

In the present paper we follow the example of Fréchet in assuming once for 
all that 6(qq) = 0 and that 6(q: g2) = 6(q2q:). In other words we assume 
that the distance from an element to itself is zero and that the distance between 
two elements is independent of the order in which they are taken. In the 
first part of the paper we give very simple conditions on systems (22; 6) which 
are sufficient for many purposes and which, in the case of compact sets, we 
show to be equivalent, so far as limit of a sequence is concerned, to the voisinage 
and thus to the écart of Fréchet.{ The remainder of the paper is devoted to 
the theory of functions on the sets Q of systems (€Q; 6). In terms of the 
conditions already mentioned we generalize the results of Fréchet and Hahn 
as to the existence of non-constant continuous functions. We give two very 
- * The results of this paper have been presented to the Society at various times, especially 
April 2, 1915, and December 28, 1916: 

t Sur quelques points du calcul fonctionnel, reprinted in Rendiconti del Circolo 
Matematico di Palermo, vol. 22 (1906), pp. 1-74. 

See also, A contribution to the foundations of Fréchet’s calcul fonctionnel, by T. H. Hilde- 
brandt, American Journal of Mathematics, vol. 34 (1912), pp. 237-290. 

t E. W. Chittenden has proved that the voisinage and the écart of Fréchet are equivalent 
so far as the limit of a sequence is concerned. Cf. these Transactions, vol. 18 (1917), 


pp. 161-166. 
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mild conditions on systems ({2;6) which are completely independent and 
which secure for all continuous functions on © the important properties of 
the continuous functions of a real variable. Also we give a set of three com- 
pletely independent conditions (the two above and one other) which we show 
to be both necessary and sufficient for a so-called uniformly proper theory of 
continuous functions where the term uniformly proper is used in a sense 
likely to be admitted by any one who gives the matter careful consideration. 


1. INTRODUCTION 


By the notation (QQ; 6) we denote a set © of elements q and a function 6 
which assigns to each pair q; g2 of elements a real number 6(4q: q@) = 0. 
We also assume, once for all, that for every element ¢, 6(qq) = 0, and that 
for every pair qi, 92 of elements 6(q: q2) = 6(q249q1). We follow Fréchet in 
saying that q is a limit of the sequence gn, Lx qn = q, when and only when 
Ln6(qnq) = 0. We shall be interested in the following properties of 6, or of 
systems 6). 

(1) If 6(q1 q2) = O then qi = 

(2) If Ln gin = gand Ly (qin Gon) = O then La don = 

(3) If La din = 7 = Ln Gon then Ln b( Gin gon) = 0. 

(4) If Ln (qin Gon) = O and Ln (don dan) = O then Lind (Gin Yan) = 0. 

(5) There is a function @(e) such that L.-o¢@(e) = 0 and such that if 
§(q1 92) Se, 6( 9293) Se then 6(q 93) = o(e). 

(6) 6(q1 92) + 93) = 6(q 93). 

It will be seen at once that (2), (3), and (4) are important properties which 
are implied by (5) or (6). We will show that (2), (3), and (4) play a funda- 
mental réle. 

The notation 6" denotes the fact that 6 has the property (n). Thus 6” 
denotes a 6 which possesses the properties (1) and (3). The voisinage of 
Fréchet is a 6” and the écart a 6'*. 

Our terminology is that of Fréchet except that we do not wish to imply 
that the limit of a sequence is unique. Thus a set Q of the set Q of a system 
(QQ; 6) is said to be compact in case every sequence of distinct elements of QD 
gives rise to at least one limiting element. D is closed in case every limiting 
element of Q is of Q. It should be noted that, in a system (QQ; 6) where 
5 (q1 g2) may be zero for q; and q distinct, an element q may be the limit of a 
sequence composed of a single element, other than q itself, repeated infinitely 
often. If Q is compact and closed then © is said to be extremal. ‘For the 
case of non-unique limits it is desirable to take special note of classes Q which 
may not be closed but which are such that every sequence {q,} of Oo, which 
has a limit, has a limit in Q. Such sets are said to be self-closed. If a set OQ 
is compact and self-closed it is said to be self-compact. The property (2) will 
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prove to be of fundamental importance and, for lack of a better term we 
venture to call a system (0; 5) a coherent system.* A system (9; 56) is 
said to be limited if there is a positive number h such that for every q: q2, 
6 Q)= h. 

We may have two systems (9; 6) and (Q; 6), the set Q being the same 
in each case but the two distance functions, 5 and 6, not the same. If we 
wish to indicate that q is a limit of the sequence {q,} we write, in the first 
case, Lngdn = q, and in the second case, we Qn =q. Two systems (Q; 56) 
and (Q; 4) are said to be L-equivalent in case Ly gn = q implies Ln Qn = 9 
and conversely. 

Denote by $ a class of elements p. If 8 is a subclass (reductiont) of Q 
of a system we denote by 6) a reduced system where 5 (1 p2) 
= 6(4q: G2) in case py; = gq; and pe = q.. It will be noted that, if a system 

©); 6) has any one of the properties (1) --- (5), then any reduction of (9; 6) 
has that property also. 

We speak of extremal systems (0; 6), closed systems (©; 6), etc., in an 
obvious way. A compact system is a reduction of an extremal system. 

Two elements ¢, q of a set Q of a system (Q; 6) are said to be connectedt 
by 6 if for every e there is a finite sequence q1-, doe, *** » Ye, e Such that qie = 7, 
= q and such that 6 (qie = 1,2,---,k, —1. The set 
is a connected set, or the system (0; 4) is a connected system, in case every 
pair 9: 2 of elements of Q is connected by 6. Two classes 21, Qe of O are 
directly connected if for every e there is a qie of QQ); and a qe, of Qe such that 
5 (die ae) Se. If Qe is a single element gq. and 21, Dz are directly con- 
nected, then gz must be an element of Q, or else a limit of a sequence of 
elements (not necessarily distinct) of ,. If two elements q:, q2 are directly 
connected then 6(q: g2) = 0. 


2. Systems (Q; 6) 


THEOREM 1. For every coherent system (90; 6) there is an \-equivalent 
coherent system (Q; 6) .§ 

In proof of this theorem 5(q: 42) is defined to be d, the greatest lower 
bound of all d such that there is an element rg of © such that 6(qi: ra) Sd 


* The importance of coherent systems is apparent from the fact that in such systems the 
derived class of every class is closed, cf. E. R. Hedrick, these Transactions, vol. 12 
(1911), p. 285. : 

+ Cf. E. H. Moore, Introduction to a Form of General Analysis, §52. New Haven Mathe- 
matical Colloquium, Yale University Press, 1910. 

t Cf. On the connection of an abstract set, etc., by A. D. Pitcher, American Journal 
of Mathematics, vol. 36 (1914), pp. 261-266. 

§ An important consequence of this theorem is: In a coherent system two sequences which 
have a common limit have all limits in common. 
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and 5(raq2) =d. It is at once evident that 6(qq) = 0 and that 


5(q1q2) = 


5(q1 92) S e implies 5 ( 91 92) Se, since r, may be taken as q, org.. There- 
fore Lr dn = q implies Lu dn = 

We will now show that Ln qa = q implies Ln qn = q. It is convenient for 
this purpose and for the sequel to prove the following lemma. 

Lemma 1. If bn 6 (qin Gon) = 0 there is a sequence of elements rn, 
where for any given n, r, may be identical with qin or don, such that 


LnO(qintn) = 0 and Ln Jon) = 0. 


For every ¢ there is an n, such that n = n, implies 5 (qin Qon) Se. From 
the definition of 6 there is for every n = n, an element r,,. such that 


1 
(qin tne) Se +> and (Tne Gan) SO +—. 


Let e take on a sequence of values {e,} such that L; e, = 0 and consider the 
sequence {m,,} which may be taken so that n,. < < <M, < 
Now if n < m,, fn may be taken at random. If n, Sn < my41, take r, to 
be fne,- It is clear that such a sequence {r,} satisfies the given conditions. 

In case Ln dn = q we have L,6(qnq) = 0, and therefore by the above lemma 
there is a sequence {r,} such that L»d(q.7r,) =Oand L,6(r.q) =0. There- 
fore Ln fn = q, and since (Q; 6) is coherent, =q- Therefore Lndn = 4 
implies = q- 

The system (Q; 6) is coherent. For suppose 


Lain = and (qin Gon) = 


Since (Q;8) and (Q;4) are L-equivalent, Ln»gin = q. By the lemma 
there is a sequence {r,} such that Lrd(gintn) = 0 and gon) 
Since (Q; 6) is coherent L»t, = q and Lagden =q. But (Q;6) and (Q; 5) 
are -equivalent and thus qon = q- 

The system (Q; 6) has the property (3). For suppose Ln gin = 7 = Lin Q2n- 
Since (QQ; 6) and (OQ; 4) are L-equivalent Ladin =Lnden- Then for 
every e there is an n, such that n = n, implies 6(qinq) Se and Se. 
By definition 5 (qin Se. Therefore Ly 5 (in Gan) = O. 

THEOREM 2. Every limited system (Q; is a system 5"). 

We wish to show that there is a ¢(e) such that 


(a) Lo(e) =90, (b) Se, 5(q293) Se 


implies =@(e). Given e, consider all possible g2q3 such that 
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5(q192) =e, 6(4293) Se. Denote by ¢(e) the least upper bound of all 
5(q:93). It is obvious that ¢(e) exists, since ({Q; 6) is a limited system. 
It is also true that L..o@(¢) = 0. Suppose this is not true. Then there is a 
positive number a such that for every positive number d there is an e = d 
such that @(e) >a. Thus, there is a sequence {e,} such that Lie, = 0 
and such that, forevery n,@(e,) >a. Thus there are sequences {qin}, {qon}, 
{gan} such that Ln6(qin gon) = 0, Ln Gen = 0 and yet for every n 

5 (qin Yan) 2a. This contradicts the hypothesis that we have a system 
6*). 

TuEorEM 3. For every system (0Q;6) there is an L-equivalent limited 
system (2; 6). 

For if 


92) = 1+ 


(Q; 6) and (Q; 4) are L-equivalent and 5(q gz) is always less than unity. 
As a consequence of Theorems 2 and 3 we have the following theorem. 
THEOREM 4. For every system (9; 6') there is an L-equivalent system 

(OQ; 

Tureorem 5. If (QQ; 6) is a compact system, a reduction of a coherent system 
(Q; 5), then (Q; 4) is a coherent system (Q; 5"). 

(Q; 5) is coherent and has the property (3) since it is a reduction of (Q; 6). 
Suppose (Q; 6) does not have the property (4). Then there are sequences 
{din}, {Gen}, {Gan} such that Ln don) = 0, Ln (Gon Gan) = 0 and yet 
Ln 6(qin Yan) #0. There are, then, sequences {q1n,}, {@sn,} and an e such 
that for every k, 6 (Gin, Ysn,) > €. Since (Q; 6) is compact there is a sequence 
of {qin,}, Say {Gin,,}, Which has a limit q (the sequence {qin,,} may be merely q 
repeated infinitely often). Corresponding to {qin,} there are sequences 
{qan,} and {qsn,} such that Lid (qin, Yen,,) = 0 and Lid (qan, qan,) = 0. 
Therefore, since ({Q;6) is coherent, g = Li q2ny, and q= Then, 
since (QQ; 6°), also L, 6 (qin, qan,,) = (0. But this contradicts the statement 
that, for every k, 5 (qin, Yan, ) > 

TueoreM 6. If (Q;6) ts a coherent system there is an \-equivalent, 
limited, coherent system (2; 5) such that every compact reduction (2; 6) of 
(2; is a system (Q; &) 

This theorem follows at once from the successive application of Theorems 
1, 3, 5, 2 and the fact that a reduction of a limited system is limited. It will 
be noted that to every compact reduction (Q; 8) of (Q; 6) corresponds an 
L.-equivalent compact reduction (0; 6) of (Q; 6). An important special 
case of the theorem occurs when (Q; 5) is itself a compact system. 


THEOREM 7. If (QQ; 6') is a coherent system then there is an -equivalent, 
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limited system (Q; 5) such that on every compact set QD 2 of Q, 58 is a voisinage, 
e., the system (Q; 5) is a system (OQ; 6%).* 
This theorem follows from Theorem 6 and the fact that the property (1) 
is undisturbed by the transformations used in establishing the above theorems. t 


3. THE FUNCTIONS OF HAHN 


Fréchett proved that if qo and q; are two elements of the set © of a system 
(Q; 6°) then there is a function 4, continuous on the set © of the system 

©;6), such that u(qo) = 0, w(qi) = 1 and such that + qo implies 
0 <u(q) =1. Hahn§ proved the same theorem for a system (Q; 6"). 
The following is a goneratiantion of this theorem of Fréchet and Hahn. 

THEOREM 8. If Qo and Q, are two subsets of the set Q of a coherent system 
(2); 6), which are not directly connected, then there is a function yw, continuous 
on the set Q of the system (220; 6), such that: w(q) = is of Qo, u(q) = 
if q is of Qi, and 0 < w(q) = 1 and Qo are not directly connected. || 

The proof follows that of Hahn, but owing to the weaker hypotheses here 
employed, it seems desirable to indicate our proof. 

On account of Theorem 1 we may assume that our system is a system 
(Q; 6%). Since Qo and Q, are not directly connected there is an e; such 
that if qo is of Qo and q; of Q, then 6(qo qi) 2 e1. Denote by the class 
of all q’s such that for every go, 6(¢0q) 2 e1. SQ2 contains and Qo 
have no common limiting elements. Denote by $ the class of all elements 
which belong neither to Qo nor toQ2. Thus © is divided into a sequence of 
three classes Qo B Qe such that only adjacent classes may have common 
limiting elements and such that if 6(q’q’’) = 0 then q’, q’”’ belong to the 
same or to adjacent classes. 

A sequence OQ; Dizi of classes, which is such that only 
adjacent classes may have limiting elements in common and such that if 
5(q: 92) = 0 then q; and q2 belong to the same or to adjacent classes, we call a 
Hahn sequence. Consider the class Q:; (2 +0,m). Denote by do, the 
greatest lower bound of 6(q0q) where q is of Q; and go ranges over Q;-1. 
Denote by d,, the greatest lower bound of 6(q: q) where q: ranges over O;41. 

* The following is an example of a system ( 0; 6) which is of interest here: 

= allreal numbers x such thatO=2=1; w<1; 
6(01) =6(10) =1; 
=6(271) =i; 6(1z) =1/z 
The system (QQ; 6) is extremal, coherent, perfect, but not limited. 

} From Theorem 7 and the work of Chittenden (loc. cit.) it follows that in compact sets 
coherence and écart are infinitesimally equivalent. 

t Loc. cit., § 51. 


§Monatshefte fir Mathematik und Physik, vol. 19 (1908), pp. 251. 
|| It is obvious that u (q) = 0 if Qo and q are directly connected. 


3 

j 

4 


72 A. D. PITCHER AND E. W. CHITTENDEN [January 


If dog = dig assign q to a class Qio. If dog > dig assign q to a class Diz. 
Denote by D this principle of division. 

Lemma 2. The principle of division D transforms a Hahn sequence into a 
Hahn sequence. 

In proof of this we must show that in the new sequence of classes only adjacent 
classes may have common limiting elements. It is clear that we need only 
show that OQio and Oj4; have no common limiting elements and that Qi 
and 2; have no common limiting elements. Suppose Ojo and O;4; have a 
common limiting element g. Then there is a sequence {gon} of OQiio and a 
sequence {qin} of OQi4: which have q for a limit. Then from 6° it follows that 
Ln 5 (don Jin) = 0 and from the way the division of 2; was effected there must 
be a sequence of such that La gon) = 0. Therefore Lagn = q 
and Q;-1, OQi4: have a common limiting element. This however contradicts 
the fact that the original sequence Qo --- OQ; --- Qm is a Hahn sequence. 
Similarly we may prove that QO; and O;_; have no common limiting elements. 

We must also see that if 5(q: g2) = 0 then q; and gz belong to the same or 
to adjacent classes. It is obvious that we need only see that Ojo and Oi41 
can not contain elements gio and q,4; respectively such that 6(qio qi41) = 0 
and that 2; and Q,_; can not contain elements q,;; and q;_; respectively such 
that 6(qi qi) = 0. Suppose 6 (gio gis1) = and that gio is of and g.41 
of O41. Then, from the way the division is effected, Q,_; must contain an 
element gi-;: such that 6(q:-1 qi) = 0. Then, since we have a coherent 


system, 6(q:-1 9.41) = 0, which again contradicts the rene that the 
original sequence is a Hahn sequence. Similarly for OQ and Q;1. Thus 
the lemma is proved. 

Now the above sequence OQ» $ OQ» is a Hahn sequence and by use of the 
principle of division D a succession of Hahn sequences may be formed of 
which the first one is 


Qo Bo Bi Qe, 
Qo Boo Bor Bio Bu Qe, 


and the second one, 


and the kth one, 


Do Poo --- 00 Poo... or $i, *** Bir... 10 Pir... OD 


It is not difficult to show that if q is ina class Q,, ;, ... ;, and if {qn} is a sequence 
of elements, distinct or not, such that L,qg, = q then there is an n,; and a class 
adjacent to Q,, i, ... ;, such that for n = q, belongs to Q,, i, ... or to 
this adjacent class.* Otherwise the fact that at every stage the sequence of 
classes is a Hahn sequence is contradicted. 

Thus to each element q not of Qo nor of Q,z is associated an infinite sequence 


* It is understood here that the class Oa, & «.. 4, may be the class Qo or Qe. 
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of numbers, 
225 O85 °° * » °°? (% =Oorl), 


the first k of which are identical with the subscript to the class of stage k to 
which q belongs. Define a function py; as follows: ui(q) = 0 if qg is of Qo; 
wi(q) = 1if qis of 


i 
tet: 


if q is of $. That yw; is continuous is readily seen from the foregoing and 
from the fact that if gq; and qs are of adjacent classes of stage k then 


1 


Now consider a sequence {e,} of numbers e,, of which e; is used above, 
such that for every n, @n-1 < €n < @ny1 and such that L,e, = 0. Corre- 
sponding to e; we have yu, corresponding to ¢2 we have w2, etc. The function 
= uw, /2" is a continuous function having the desired properties, viz., 
u(q) = Oifqisof Qo,u(q) = 1 <u(q) =1if Qoandq 
are not directly connected. Obviously Q» and 2 may be classes each con- 
sisting of a single element. 

THEOREM 9. A necessary and sufficient condition that every continuous 
function on a class Q of a class Q of a coherent system (SQ; 6) be bounded and 
assume its bounds is that Q be self-compact. 

That the condition is sufficient is easily proved. The necessity of the con- 
dition may be proved after the manner of Hahn.* The reader who has 
grasped the significance of the condition (2) on 6 as exemplified above will 
have no difficulty in carrying out the proof of Hahn under these milder 
hypotheses. 


4. BIEXTREMAL CONNECTED SYSTEMS (Q; 6) 


A system (0; 6) is said to be biextremal in case it is true that when two 
sequences {qin}, {gon} are such that L» 6 (qin Gon) = 0 then there is a pair of 
sequences {q1n,}, {G2n,} (subsequences of {qin}, respectively) which 
have a common limit. 

THEOREM 10. Every biextremal system (0; 6) is also extremal. 

For consider sequence {q,} of distinct elements. Then L,»6(qnq,) = 0 
and there is a sequence g,, which has a limit q. 

THEorEM 11. Jf a function pu is continuous on a set Q of a biextremal 
system (20; 6), then is uniformly continuous. 

For suppose » were not uniformly continuous. Then there must be an e 

* Loc. cit. 
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such that for every n there is an element q;, and an element q2, such that 
5 (qin Yan) <1/n and yet |u (qin) — w(qon)| Ze. Now bn (qin gon) = 0 
and since (Q; 6) is biextremal, there is a pair of sequences {q1n,}, {G2n,} 
which have a common limit, say qo. The continuity of y at qo affords a con- 
tradiction. 

TuroreM 12. If a function pw is continuous on the set Q of a biextremal, 
connected system ({Q;6), then w is uniformly continuous, bounded, assumes 
its bounds and every value between these bounds. 


This theorem follows from Theorems 10 and 11 and Fréchet,* page 8, 
corollary, and the paper by Pitcher already cited, page 264, Theorem III. 
Thus every continuous function on a set © of a biextremal connected 


system (20; 6) possesses the important properties which we usually associate 


with a continuous function on a closed interval or region. However in a 
particular case there may be very few such continuous functions. Indeed it 
may happen that only constant functions are continuous. For example let 
©) be any set of elements and for every q;, q2 let 6(qi q2) = 0. The system 
thus defined is biextremal and connected. However every function continuous 
on the set Q of (4; 4) is constant. Certainly a proper set of continuous 
functions on a set © should, in general, contain functions other than the 
constant functions. 


5. Systems (€1; 6) ADMITTING A UNIFORMLY PROPER CLASS OF CONTINUOUS 
FUNCTIONS 

We will say that the class of all continuous functions on the set Q of a 
system (1; 6) is a uniformly proper class in case the following conditions are 
satisfied. 

(A) Every continuous function on Q is bounded and assumes its bounds. 

(B) Every continuous function on © is uniformly continuous. 

(C) Every continuous function on 2 assumes every value between each 
pair of its values. 

(D) If Qo and Q, are two classes of OQ which are not directly connected, 
there is a function », continuous on the set © of the system (QQ; 6) such that 
u(q) = Oif gis of Qo,u(q) = Lifqisof Qi,and0 <yw(q) = 1if gand Qo 
are not directly connected. 

If + qe there is a continuous function such that w(qi) + (qe). 

In the sequel we give a set of conditions on ({Q; 6) which are necessary and 
sufficient that the class of all continuous functions on Q be a uniformly proper 
class. This set of conditions thus gives rise to the uniformly proper theory 
of continuous functions mentioned early in this paper. 

A system (©; 6) is said to be L-wnique in case no sequence {qg,} has more 
than one limit q. 


* Loc. cit. 
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THEOREM 13. A biextremal, -unique system (0; 6) is a coherent system. 

For suppose = Ln qin and Lan (qin gan) = Oand g + Laden. Then there 
is an e and a subsequence {q2n,} of {q2.} such that for every k, 6(qon,q) > e. 
But Li 6 (qin, Yen,) = 0 and, since ({Q; 6) is biextremal, there must be se- 
quences {qin,,} and den, } of {qin,}, {G2n,} respectively which have a common 
limit. Since ({Q;6) is L-unique this limit must be g. Thus we have a 
contradiction. 

TueoreM 14. The class of all continuous functions on the set Q of a biex- 
tremal, connected, \_-unique system (2; 6) is a uniformly proper class. 

This theorem is a consequence of Theorems 13, 12, and 8. In applying 
Theorem 8 we should note that if a system (©; 6) is L-unique then 6 (q gz) 
= 0 implies q; = q2 and no pair of classes each consisting of a single element 
can be directly connected. 

THEOREM 15. If a system (90; 6) has the property E then (2;6) is L- 
unique. 

For if there be a sequence with two limits q; and q2 and if uw is a continuous 
function then w(qi) = (qe). 

THEOREM 16. If a system (20; 6) has the properties A, D, E then (Q; 6) 
is an extremal system. 

By the previous theorem (0; 6) is an L-unique system. Suppose (Q; 6) 
were not an extremal system. Then there is a sequence {q,} of distinct 
elements with no limiting elements. Divide {q,} into two classes: 


Each of these classes is closed, they are not directly connected, and, since we 
have L-unique, neither class is directly connected with any element not in 
the class. Then, by the condition D, there is a continuous function yu; such 
that wj(qi) = 1, (0S 9); = 0 =1 for not 
of {qn}. 

Now consider the function p = 2J=? y;/2’, which is nowhere 0, and which 
is such that w(qi) = 1, w(q2) = 3, = 1/27", ---. Here 1/u 
is well defined, continuous, but not bounded. This contradicts the hypothesis 
that (Q; 6) is a system with the property A. 

THEOREM 17. If a system (92; 6) has the property C then (2; 6) is a con- 
nected system. 

This theorem is proved in the paper by Pitcher referred to above, page 265, 
Theorem Vb. 

THEOREM 18. Jf a system (2Q;5) has the properties A, B, D, E, then 
(QQ; 6) is biextremal. 

For suppose (9; 6) is not biextremal. Then there is a pair of sequences 
{qin}, {Gen} such that 6(qin Gon) = 0 but such that no sequences {4q1n,}, 
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{q2n,} have a common limit. We may suppose that neither {q:,} nor {qen} 
contains a single element repeated infinitely often. For suppose {qi,} con- 
tains a single element q repeated infinitely often. Then this element q forms 
an identical sequence {q1n,}, Gin, = q for every k. Moreover the sequences 
{qin,} and {qon,} have a common limit, the element q itself. This contradicts 
the statement that no sequences {q1n,}, {2n,} have a common limit. By 
Theorem 16, the system (0; 6) is an extremal system. Then there is a 
sequence {qin,}, a subsequence of {4q:,}, which has a limit q,. The sequence 
{qen,} contains an infinity of distinct elements and, since ({Q; 6) is extremal, 
there is a subsequence } of {q2n,} which has a limit Thus 


L (ding, ) 0, L Jin, is L any, = 


By Condition B a function », continuous on Q, is uniformly continuous. 
Therefore (qin, ) | Also Lin (qin) and 
Lip (qen,) =«(g2). Thus if is continuous =4(q2). But this 
contradicts the hypothesis that ({Q; 6) has the property FE. 

From Theorems 14, 15, 17, 18 we have the following theorem. 

THEOREM 19. A necessary and sufficient condition that the class of all con- 
tinuous functions on the set Q of a system (<2; 6) be a uniformly proper class is 
that (2; 6) be biextremal, connected, L -unique. 

The significance of this theorem is emphasized by Theorem 7 and the 
theorem of Chittenden, previously referred to, that for every system (OQ; 6") 
there is an L-equivalent system (Q; 6'*). 


6. INDEPENDENCE CONSIDERATIONS 
The notion of the complete independence of a set of properties or postulates 
has been introduced into the literature by E. H. Moore.* We give here a 
set of eight examples of systems ({2;6). Each example is preceded by a 
combination of plus and minus signs which indicate the character of the ex- 
ample as to the properties biextremal, connected, L-unique. Thus (+ — +) 
denotes that the following system (90; 6) is biextremal, not connected, 
L -unique. 
(+++) = all real numbers zx such that 0 =z = 1. 
5(a1a%2) = — 2}. 
(—++) all real numbers x such that 0 < x 
6 (2 Xe) | ary a2|. 
all real numbers zx such that 0 = x 


6 (21 22) = | ary 


ug 
* Loc. cit. 
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(+ + —) = all real numbers z such that 0 = x = 1 and an element *. 


5 (2122) = — 5(rx) = |x 1). 


(— — +) Q = all real numbers z such that 0 < x = 1 or 2 
6 (21 ae ) = | at 


(+ — —) Q = all real numbers z such that 0 = x = 1 or2 =x =3 and 
an element *. 22) = — = | 


(— + —) Q =all real numbers z such that 0 < x = 1 and an element x. 


5 (2122) = |a, — = |z -—1]. 


(— — —) Q = all real numbers z such that 0 < x = 1lor2 =2=3 and 
an element *. 6(212%2) = |a1 — %2|; 6(a*) = |x 1). 

From these eight examples we have the following theorem. 

THEOREM 20. The properties biextremal, connected, -unique of systems 
(21; 6) are completely independent. 


7. ANALYSIS OF THE PROPERTIES BIEXTREMAL AND COHERENT 


We have already seen that every biextremal system (©; 6) is extremal and 
that every biextremal, L_-unique system (2; 6) is coherent. It is also true 
that every coherent, extremal system (2; 6) is biextremal. For suppose we 
have a coherent, extremal system (Q; 6) and two sequences {qin}, {gen} such 
that Ln 6 (qin Gen) = 0. We wish to show that there is a pair of sequences 
{qin,} {Gen,} Which have a common limit. If either {q:,} or {qon} contain a 
single element repeated infinitely often our contention is established at once. 
If such is not the case, then, since ({Q; 6) is extremal, there is a sequence 
{qin,} Which has a limit g. But since (Q; 6) is coherent and 


(qin, ) = 0, 
we have also Li don, = q- 

THEOREM 21. For L-unique systems ({9Q;6) the property biextremal is 
equivalent to the two properties extremal and coherent. 

Thus the four properties: extremal, connected, L-unique, coherent serve 
equally well as the basis of a uniformly proper theory of continuous functions 
and in fact are both necessary and sufficient for the same. The reader can 
show, without serious difficulty, that these four properties are also completely 
independent. 

The following properties are interesting and important. 

(a) Every sequence {q,} contains a sequence {qn%} which has a limit. 

(b) If a sequence {q,} has a limit and Ly» 6(qin gon) = 0, then {qin} and 
{q2n} have subsequences {q1n,}, {2n,} which have a common limit. 
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(ec) If q is a limit of a sequence {q:,} which has a limit in common with a 
sequence {q2,} then q is a limit of {4q2,}. 

TneoreM 22. The property biextremal is equivalent to properties (a) and (b) 
and the property coherent is equivalent to properties (b) and (ce). 

The first part of the theorem is readily proved. It is also not difficult to 
see that every coherent system possesses the Property (b). To see that every 
coherent system possesses the Property (c) note that if q: = Lxaqin and 
Lin Jin = Jo = Ln Yon then 6(qi qo) = O and qi = Laden. Also if (0; 6) has 
the Properties (b), (c) and if Ladin = ¢ and Lin 4 (qin Gon) = O then there is an 
element qo and sequences {qin,}{Qen,} such that Le qin, = Jo = Le Then 
from (c) q is a limit of {qo,,}. The supposition that q¢ is not a limit of {q2n} 
evidently leads to a contradiction. Thus properties (b) and (c) imply the 
property coherent. This theorem focuses on the importance of the properties 
(b) and (ec) and suggests sets of basic properties other than those given above. 

WesTERN Reserve UNiversity, CLEVELAND, OHIO 
UNIversITY OF ILLINOIS, CHAMPAIGN, ILLINOIS 


4 

4 

a 


EXISTENCE THEOREMS FOR THE GENERAL REAL SELF-ADJOINT 
LINEAR SYSTEM OF THE SECOND ORDER* 


BY 
H. J. ETTLINGER 


INTRODUCTION 


Sturmf in 1836 established many fundamental theorems concerning the 
properties of solutions of the linear differential equation (1) below, and the 
system formed by (1) and the boundary conditions (3), of which the oscillation 
theorems of § 1 of the present paper are immediate consequences. 

In the special case of periodic conditions Masont and Bécher§ with more 


or less generality established an oscillation theorem. 

Birkhoff|| extended their work to the general self-adjoint linear boundary 
conditions (see (5) and (1)), where, however, he assumed K = 1, and that A 
does not enter into the boundary conditions, and established an oscillation 
theorem for u,(2), the solution corresponding to the pth characteristic 
number. 

It will be the object of this paper to generalize these results to the most 
general real, self-adjoint linear system of the second order, where K and the 
coefficients of the boundary conditions are functions of A, by extending 
Bécher’s and Birkhoff’s methods,{ which are based on the application of 
Sturm’s theorems to this system.** 


* Presented to the Society, under a different title, April 26, 1913. 

TJournal de mathématiques, vol. 1 (1836), pp. 106-186. 

tMathematische Annalen, vol. 58 (1904), these Transactions, vol. 7 
(1906), pp. 337-360. 

§Comptes Rendus, vol. 140 (1905), p. 928. 

|| These Transactions, vol. 10 (1909), pp. 259-270. 

{| Some of the theorems obtained by Birkhoff have been worked out independently and 
by other methods by Haupt. See Dissertation, ‘‘ Ueber Oszillationsiheoreme,”’ Teubner, 
Leipzig, 1911. See also Mathematische Annalen, vol. 76 (1914), pp. 67-104. 

After a report of my results had been published (see Bulletin of the American 
Mathematical Society, vol. 19 (1913), p. 502), the case where K is a function of 
and the coefficients of the boundary conditions are constants, together with the corresponding 
system in difference equations was partially treated by Fort, “‘ Linear difference and differential 
equations,” American Journal of Mathematics, vol. 39 (1917), pp. 1-26. 

** The writing of this paper has been furthered by stimulus and suggestions from Professors 
Birkhoff and Bécher, to whom I desire to express my cordial appreciation. 
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1. ON A STURMIAN BOUNDARY PROBLEM 


Given the second order linear differential equation, 


du 


let us consider the following linear combinations of a function u(2,\) and 
its first derivative with regard to z, u’ (2,2), 
= ai(A)u(x,r) — Bi(A) K (a, d) (2,2) 
(2 (i =0,1), 
= dr) + 6: (A) K (2, d)u’ (2,2). 
By the sets of conditions (A), (B), for the equation (1) and the linear combi- 
nations (2) shall be meant the following. 
Conditions (A). 
I. K(2,X) and G(x, X) are continuous* real functions of x and X, for all 
real values of z in the interval 
(X) 
and for all real values of X in the interval 
(A) (£,<r < 


II. K (2, X) is always positive in (X, A). 
III. Throughout (A), the eight coefficients a;, --- , 6; of (2) are continuous 
real functions of \, and 


Jas] + >0, + > 0. 


IV. (1) K and G always decrease (or at least do not increase) as \ increases. 

(11) Either 6; = 0, or 8; + 0, in which case the quotient a;/8; decreases 
(or at least does not increase) as X increases. Also either 6; = 0, or 6; + 0, 
in which case the quotient y;/5; decreases (or at least does not increase) as 
increases. 

V. For any arbitrary \, there shall exist an x for which K or G actually 
decreases as X increases, unless for this value 8; + 0 (or 6; + 0) and a;/B; 
(or y;:/6;) actually decreases for this value of \ as \ increases. 


Conditions (B). 

Such further conditions on the coefficients of the system as will ensure the 
correctness of Sturm’s Theorem of Oscillation for the system (1), (3). Various 
sets of conditions have been worked out by Professor Bécher in Chapter III, 

~ * The existence and continuity of 0K / dx is commonly required, but this is not necessary. 
See Weyl, Mathematische Annalen, vol. 68 (1910), p. 221 and Bécher, these 


Transactions, vol. 14 (1913), pp. 412-418. 
t In particular, £, and £L, may be — ~ or + © respectively. 
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paragraphs 13-15 of his recent book, Lecons sur les Méthodes de Sturm (Paris, 
1917). 


Consider now the sturmian system consisting of the equation (1) and 
conditions 


In{u(a)] =0, 


©) Mo[u(a)] =0. 


If u(2, X) is a solution of (1), (different from zero), satisfying the first bound- 
ary condition of (3) and if we have either 

Yo Yo 71 

5 < 3, or 30 > 5,” 
if Yo 5: — 50 ¥1 is not zero in (A), and if the expressions involved satisfy (A), 
then it follows from Sturm’s Oscillation Theorem that 


Mo[u(b)] and M,[u(b)] 


change sign when they vanish and their roots separate one another, and also 
that the roots of w(a, \) decrease as increases. * 

Concerning the sturmian system (1), (3) we have the following results,t 
provided (A) and (B) are satisfied: 

I. There exist an infinite number of characteristic values{ for the system 
in (A), having a single cluster point at £2. 

II. If we arrange these values Ao, A1, «~~ in order of increasing magnitude 
and denote the corresponding characteristic functions by Up (a2), Ui(2),---, 
then U,,(x) will vanish precisely n times on a < x < b. 

III. If w(2,X) be the particular solution satisfying 


In{u(a)] = 0 
for which 
u(a,r) = BoA), K(a,r)u’(a,r) = ao(A), 


then forz >a 


lim = &, limu(z,A) = 
A=% 


lion (asd) _ 
u(x,r) 


* Loc. cit., pp. 139, 143. 
t See Bécher, Encyklopddie der mathematischen Wissenschaften, II A, 7a. 
tA value ; of \ is said to be a characteristic value of a system such as (1), (3), if when 
\ = Xi, this system has a solution not identically zero. Any such solution is termed a char- 
acteristic function. 
Trans. Am, Math. Soc. 6 
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We may notice that a very important special case of a system satisfying 
(A) and (B) is formed by the equation 


d 
| — [l(x) =0 


together with boundary conditions in which the coefficients are constants, 
provided k(x) >0,g(x) >0. The interval (A)is(—- ©, +0). 

If however g changes sign, and 1 = 0, ao By = 0, Yo do = 0, the equation 
may be put in the form 


d[ k(x) du I(x) 
If the parameter \ be replaced by the new parameter \ = |A| the condi- 


tions (A) and (B) are satisfied for the interval (0, + ©) for(A). By this 
device, due to Bécher,* the case in which g (2) changes sign can be dealt with. 


2. THE GENERALIZED SELF-ADJOINT BOUNDARY PROBLEM. EXISTENCE OF AN 
INFINITE SET OF CHARACTERISTIC VALUES 


Consider the system consisting of the differential equation (1) together with 
the boundary conditions 
Lo[u(a)] = Mo[u(b)], 


(4) 
L,{u(a)] = Mi[u(b)], 


where L; and M; are defined by (2). Throughout this paper we shall suppose 
that the system (1), (4) satisfies the set of conditions (A), $1, and that 
the conditions (4) do not reduce to the sturmian type for any \, and that 
these conditions are self-adjoint 


ag Bi — Boor = — — bo 41). 


Now the quantity ao 8; — Bo a: is never zero in this case, since the condi- 
tions (4) are then reducible by linear combination to the sturmian type. By 
division of the first equation (4) by this quantity, we may, without loss of 
generality or effect upon (4), (B), $1, take 


(5) ao Bi — Boor = — — = — 1. 


We shall consider only real solutions of (1), (4). 

Let uo(x,) and u,(x,2) denote the two linearly independent solutions 
of (2) satisfying the conditions 
 * Cf. Bocher, Proceedings of The Fifth International Congress of Mathematicians, Cambridge, 
England, vol. 1 (1912), p. 173. 
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© Io{uo(a)}=0, Lnluo(a)] =1, 
In{wui(a)] = 1, I, [u(a)] = 0. 

Solving the first two equations, we have 

(7) =ao(X),  wol(a,r) = Bo(A); 

and solving the second pair, 

(8) K(a,d)uj(a,4) = —ai(dA), = — Bi(A). 


By (A), ao and Bo may not vanish together, so that (7) and (8) also deter- 
mine wp and wu; as functions of x, linearly independent for every value of \. 
Abel’s formula for (1) gives 


1 
(9) Uo — = — 


Also direct computation and simplification by (6) and (9) show that 
Lo[ uo] Li [v1] — = —1, 
Mo[ uo] Mi Mo[ My [ 


THEOREM. A necessary and sufficient condition that there exist a solution u 
not identically zero satisfying (4) when X = lis that ¢(1) = 0, where 
(11) = —2+ + 
Proof. The general solution of (1) is 
u(x,rX) = um 


A simple reduction by means of (6) shows that a necessary and sufficient con- 
dition that u(2,) not identically zero satisfy (4) is that co and c¢; satisfy 


cof — Mo[wo(b)]} +e1{1 — Mo[m(b)]} =0, 
— Mi[wo(b)]} Milw(b)j} =90, 
and are not both zero; this is possible if and only if 


| = Molwo(b)] 1 — Molm(b)] 
90%) _ — Mila (b)] 


By (10) this reduces to (11). 

The equation ¢(A) = 0 is the characteristic equation, that is, an equation 
which has for its roots all the characteristic values and no other roots. We 
may note that the quantities co and c; are determined uniquely, except for a 
constant multiplier, provided not all the elements of the determinant (12) 
vanish. A value, 1, for which ¢(1) = 0 but not all the elements of (12) 
vanish, is said to be a simple characteristic value. 


= 0. 
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If all the elements of (12) are zero, then cg and c; are both arbitrary and 
there will exist two linearly independent solutions of the system (1), (4).. A 
value \ = /, such that ; 


Mo[uo(b)}=0, =1, 
M,[uo(b)] = 1, M, (6) ] 0, 


is called a double characteristic value.* 
Consider the auxiliary sturmian system 


In[u(a)] = 0, 
Myo[u(b)] =0. 


(13) 


(14) 


We shall assume that this system satisfies conditions (B) of § 1 in addition 
to (A). Let the characteristic values of this system be Xo, A1, «++, ordered 
so that 

No < Ar 


EXISTENCE THEOREM. [If the system (1), (4) satisfies conditions (A), (B), 
there exists at least one characteristic value of the system (1), (4) between any 
two characteristic values of the auxiliary sturmian system (14). 

To prove this we first note that we have already determined wo so as to 
satisfy the first of conditions (14), and the only solution which can be so 
determined differs from wo merely by a constant factor. Consequently, the 
characteristic values of the system (14) are the roots of the equation 


(15) Mo[uo(b,r»)] = 0. 

Hence we have from (10) for A = A;, 
Mo[ui(b, Milw(b,A:)] = 1 

Reducing (11) by means of this last relation, we have 


1 


(16) $(%) = Tug (b, 


M,[uo(b, Ai) 


so that at X = X;, #(A;) has the same sign as M,[wo(b,A;)] except when 
M,[uo(b,r;)] = 1, that is when ¢(A;) = 0. 

We may apply the results of Sturm to the functions Mo [uo(b,)] and 
M,[u(6,X)], so that these functions must change sign when they vanish 
and their roots separate one another.t Equation (15) taken in conjunction 


* Hence at a double characteristic value we have by solving (13), 
K (b,l) uy (b, 1) = 0, uo(b,l) = — do, 
K(b,l)u,(b,l) =—y, w(b,l) =&. 


a 
See § 1. 
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with this fact establishes that if M,[uo(b,)] is positive at Xo, Ae, ---, it is 
negative at \1, As, --* or vice versa. Hence ¢(X) is positive or zero at Xo, 
he, and negative at Ai, As, ---; or else is positive or zero at Ai, 
hs, and negative at Ao, In either case has at least one 
root between A, and A,y1. This proves the theorem and carries with it the 
existence of an infinite set of characteristic values for the system (1), (4) 
under the conditions stated. 


3. SOME PROPERTIES OF THE FUNCTION ¢()) 


In considering the uniqueness of the characteristic values, we shall need 
information concerning the behavior of ¢(X) near simple and double char- 
acteristic values. 

We now introduce the further condition 


ao Yo 
a1 71 
, 
ay Yo 
, , 
71 


(C) 


This condition is unaltered by linear combination of the two boundary con- 
ditions. 

TueoreM I’. If (A), (B), (C) are satisfied in the vicinity of } = 1, a 
simple characteristic value, @() changes sign in such wise that dp/dd has the 
same sign as Mo[uo(b,rX)] or — Mi[ui(b,A)], provided that K, G, ao, 
have continuous derivatives with respect to d of the first order, and for \ = I either 
K, or G, is negative for at least one value of xina<x=b. 

TueorEM II’. Jf (A), (B), (C) are satisfied in the vicinity of X = 1, a 
double characteristic value, @() preserves a negative sign, while Mo[uo(b, dr)] 
and — M,[u(b,)] change from positive to negative as d increases through l, 
provided that K, G, ao, «++ 5; have continuous derivatives with regard to X of the 
first two orders, and for \ = | either K, or G, is negative for at least one value 
ofxina<2xZb. 

Theorem I’ will be proved if it is shown that d¢/dd has the same sign as 
Mo[uo(b,r)] or — Mi[ui(b,r)] at We proceed to compute 
do/d\. For this purpose, it will be convenient to introduce the following 


system of equations: 
2’ =G(2,A)u, 


(17) = 
ai(A)u(a) — Bi(u)2z(a) = yi(A)u(d) + (H)z(b) (i =0,1), 


H(x,u) =1/K(2,p). 


where 


| bo 

4 | | 1! <0. 

5o 

| 
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If we eliminate z from the equations (17) we obtain 


L;{u(a)] = M;[u(b)] (¢ =0,1), 


where by Io, L;, Mo, M, are denoted the same expressions as in (2) save that 
Bo(A), Bi(A), 61(A) are replaced by Bo(u), Bi(u), do(m), 5: (u) 
respectively. Evidently if we write \ = uw, the equations (17’) become pre- 
cisely (1), (4). 

If now we define uwo(2;X, mu), u1(2;A,m) as the solutions of the first 
equation (17’) satisfying modified conditions like (7), (8), 


K(a,) uy (a;X, w) = ao(d), uo (a;r, = Bo(n), 
K(a,d)ui(a;A,4) =—a(d), = 


and if, as in (11), we define 
= + — 2, 
then we have clearly 


dd 


in consequence of the obvious identity ¢(A) = @(A,A). 
If, moreover, we write 


Or Ou 


zo(@;A,u) = = K(2, mu) uj (25d, 4), 


it is seen that (9, 2) and (2, z,;) may be defined as the solutions of the 
first two equations (17) which fulfil the further conditions 


zo(a;A,m) = ao(A), uo(a;A,u) = 
We may now give ¢(A, uw) the form: 
= (A) (O52, + 41 20(55 A, w) 
+ yo(A) (653A, w) + — 2. 


By virtue of the symmetry in wu and z on the one hand, and in \ and yp on the 
other, in these equations and in (17), it will be necessary to compute merely 
0¢/0X to obtain 0¢/du. For then replacing \ by yw, G by H, a; by — B;, 
vi by 6;, ui by 2; + 7), we shall have 0¢/dy. 

It is worth noting that the equations (5) may be used in simplifying d¢/dX 


q 
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for \ = w, since these remain unaltered if a; be replaced by — 8;, yi by 4;, 
B; by = Gis and 6; by Yi- 
Now we have 
Ou(b) dz (b) (6b) 
+ you (b) + yi uo(b), 
where the accent indicates differentiation with respect to \, and the argu- 


ments A, w are omitted in wo, Zo, U1, 21, Yo, Y1» 50, 61. If we differentiate the 
first two equations (17) for w = uo with regard to A, we obtain 


d Ou 


We may also notice that duo/dX satisfies the initial conditions 


Oug(a) (a) 


= 0, = K(a, x uo (a) = a 


The equation (19) is a linear, non-homogeneous equation of the second order. 
If the right hand side is replaced by zero, wo and u; are two linearly independent 
solutions of the resulting homogeneous equation. Lagrange’s Method of 
Variation of Parameters yields as the general solution 


Or — K(E,m) 


+ Co(A)uo(z) + Ci (A) um (2x), 


which, after simplifying by (9) and solving for C; to satisfy the initial condi- 
tions, becomes 


— B, a Uo (x) — Boa u(x). 


Also by differentiating as to x and multiplying by K (x, »), we find 


Oz 


— Bi a — Bo a 21 (2). 
In exactly similar fashion, we determine 


+ Bi a + Boaim(x), 
aG (x, 


_ 9K (x, 
Ar 


*G, 


Ku 
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| 


+ Bi a 20(%) + Boa 2 (2). 
By means of (18), (20), (21), and (22) we may write 


=A 


+ {Mo[uo(b)] Bia; + Mo[ui(b)] Bo a: 
— M,[uo(b)] Bi a — Bo a}. 


We may write out 0¢/dyu from symmetry, and combining for » = A with 
0¢/0X, we obtain 


f Fiu(e), uo (E)]G, dé 


where 
F[s,t] = Mo[uo(b)]s8? + {Mi[uo(b)] Mo[ui (6) — Mi[u(b)]#, 


and 


R= Mo[uo(b)] (a: Bi — Bi) + [uo (ay By — fr a ) 
+ (b)] (a; Bo — = M, (b)] (a Bo — By ) 
+ uo (b) + 8) 20(b) + (b) + (b). 


If ap --- 6; are all constants, then R = 0, and d@/dd reduces to the sum 
of two integrals, each of which has an integrand whose first factor is a quad- 
ratic form. The common discriminant of these forms is 


D = {M,[wo(b)] — Mo[ui(b)]}? + 4Mo[ wo (6)]. 
From (10) we have 
Mo[uo(b)] Milwui(b)] = —1+ Mo[u(b)] Miluo(b)] 
D = {Mo{u(b)] + Miluo(b)}}? 4. 


The discriminant breaks up into two factors, one of which is precisely ¢ (\) 
by (11). Hence at \ = / the discriminant vanishes. 
Now at a simple value, not all the coefficients of the quadratic forms can 


9 
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vanish (see (10) and (13)); furthermore, wp and w are linearly independent. 
Hence the first factors of the integrands will vanish only at isolated points of 
the interval a < 2=b for But either G, or K, is negative for at 
least one value of z in a < x =}, and zero when not negative. Hence the 
sign of d¢/dd is that of M,[ui(b)] or — Mo[uo(b)] when ao --- 6; are 
constants. 

Now suppose that ap --- 6; are functions of X. Let us determine under 
what conditions R will have the same sign as that of the two integrals of (24). 
A necessary condition that the last line of R be expressible in the form 


Co Mo[uo] + Do Miluo] + Di + Ci Milus] 
is that 
= Covot Don, 
5; = + 
=Dywt+Cin, 


5) = dD, 5o + Cy 
whence solving we have 


Co= 7115-151, Cy = ¥o 8) — Yo So, 
Do = — 60, Di = Yob1 — 1155. 


Since Do = — D, by (5), and since the coefficients of My[u;(b)] and 
M, [uo (b)] in the first two lines of R are equal for the same reason, we have 


R = AM, + — Mil[uo]}— CMo[uo], 


2d ( a 2d ( 

a= 

B = (ay Bi — a1 By) + (yo — 55), 

d 

e= — 

) 

The coefficients & and © are positive or zero in consequence of (A), § 1. 
But we have 


where 


Mo[u] + Mi[uo] = 2 
at\ =71. Combining with (10) we find 


Mo[u] — Miluo] = + 2V— Mo[uo] Mi[m]. 


Let uw? = + M, [wu] and v = F Mo[u0]; then it is clear that 


R= + + Cr’). 


‘ 
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The necessary and sufficient condition that this quadratic form be semi- 


definite is that 
ace 
which may be written 


ay Bo — Boa + yo 50 Yo ao Bi — Boa + — 


( ) a; By — Bi ay +7186) — bi yo — Bia +7146 — 


= 
since differentiation of (5) shows that the second element of the first row of 
this determinant is equal to &. 

The determinant in (D’) may be written in the form 


|| ao Bo 5o| ao Bo| 5o 
» Bol jar Bil & 
By v1 6; | Pr) 6; 
Bo} |vo jar Bil 
Bo| Bi Bo| a Pi Yo 
Bo| jer Bi 1 Biller Bil 
a 41 _ Bi 


But by expanding by Laplace’s development we have 


a, Bi a Bolla: Bi 


ay Bi | ao aX Bo ao Bo ay Bi 
0 


a Bo Bi 
By 


| a0 Bo 
| 


ao Bo! | 


a) Bi | ao Bo 
But since the system is self-adjoint, 
Yo Bo _ Bi _ | Bo ay 
billet Billa; Biles las Bal 
5, 


Also 
Hence 
lao Bol|vo , jar 
Billa Ally lo; 


a | a Bolly 41 a Billyvo 


a, Billyi Billys las Belly, 


= 0. 


5; 
| 
ao Bo 
8; | ° 
5o| 
5, | 
| 
| 
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But this is the Laplace development by the first two columns of the determinant 


Yo 

71 

which reduces (D’) to (C). 

Hence if (C) is satisfied, R , and also , has the same sign as M, [u, (b)] 
or — Mo[uo(b)]atX =1. IfX = lisa simple characteristic value, it follows 
that ¢ (\) changes sign at\ = /, having for \ > / the same sign as M,[ u (6)] 
or — Mo[uo(b)], and for \ <1 the opposite sign, at least if the functions 
K, G, a, «++, 6; have continuous derivatives with regard to A}. Thus 
Theorem I’ is proved. 

The first part of Theorem II’ will be proved if we show that d* ¢/d)* is 
negative at a double value. The formula obtained above for d¢/d\ shows 
that d¢/dd vanishes at a double value \ = / in consequence of equations (13). 
Also, if we differentiate the second identity (10) twice as to \, we obtain at 
such a value 


My [ U1 ] M, [ uo] OM, [ Ug | [ ur] [ Uy ] 
On On or On on 


But the left-hand member of this equation is precisely d? ¢/d\? while the 
right-hand member can be evaluated explicitly by use of (20), (21), (22), (23). 
Hence at such a double value we have 


—f(E (g(a) (a) 
— 2f(E)g(E)fi(n) |dédn + R, 
f=wv-G,, 
g=mV-G,, 


R= (ae 8) (By + Fo) (— f — ff 


awe), 


E; = a Bi — a: Bi, F; = 8; — 


Now by (A), $1, the quantities G,, K,, E;, F; are negative or zero, so 
that R is negative or zero. 


N 
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It remains to determine the sign of the integral terms in d’?¢/d\?. By 
interchanging the variables of integration in the proper manner, these terms 
take the form 


-3f f (a) — fila) 
+ 2(f(&) (9) — fila) 


which is negative, since either f and g or f; and g; are linearly independent. 
It is obvious, therefore, that d? ¢/dd? is negative at a double characteristic 
value. Hence ¢() preserves a negative sign at a double value. 
Finally, we observe that a direct computation of dMo[wo(b,)]/dd and 
dM, [u:(b,)]/dd at such a double value, /, using (13), (20), (21), (22), (23) 
yields for \ = I, 


dM. b,x 
+ ff (Bo + Fo), 


These expressions are positive and negative, respectively, which proves the 
second part of Theorem II’. 
We proceed to state and prove the above theorems in a more general form, 


after introducing the condition 


(C) 


Aa, AB, Ay 


TueorEeM I. If (A), (B), (C) are satisfied in the vicinity of \ = 1, a 
simple characteristic value, @(X) changes sign in such wise that 


o(1+Ad) Ad 
Ay 


has the same sign as My[uo(b,r)] or — My [wm (6,A)]. 

TueoreEM II. Jf (A), (B), (C) are satisfied in the vicinity of X =|, a 
double characteristic value, preserves a negative sign, while Mg [uo(b, dr) 
and — M, ju, (b,2)] change from positive to negative as X increases through |. 

Suppose that Theorem I did not hold for the case where the derivatives 
fail to exist. For instance, suppose that ¢(A) did not have the specified 


sign ath =1+AA>1. 


| 
i] 
Qo Bo Yo 
a1 Bi Yi 
=0. 
= Aco AB Ayo Ado 
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Consider modified values of K, G, ap --- 6; as follows: 


K(2,d) = K(z,1) + ad) K(2, 01, 


G(x,d) = G(2,l) Ad) — G(z,1)], 


a! 


a@o(X) = ao(l) -+ — 


(1+ Ad) — 6 


an 


(4) = +> 


The system with the modified coefficients will satisfy all the conditions as to 
derivatives and will coincide with the given system for \ = / and for \ = l 
+ Ar. Let i, %, denote the two linearly independent solutions of the 
system (2), (6), whose coefficients are defined by (25). Let o(d) = 0 be the 
characteristic equation for this system. Now it is clear that ¢(1) = (1) = 0 
and ¢(1+ Ad) = ¢(1+ Ad). The above conclusion holds for ¢(A), so 
that at \=1, @ ‘changes sign and for \ > 1, (A) has the same sign as 
M,[a%(b)] or — Mo[io(b)]. By however, does not 
have this sign at X =1+ Ad. Hence #(X) must vanish in the interval 
(1,1 + Ad) for Ad arbitrarily small, i. e., ¢(l’) = 0, where] <I’ <1+ AX. 
But by Theorem I, for two successive values for which ¢(X) = 0, Mo[io(b)] 
and [a (b)] must vanish in (J, I’): 


=0, 


We infer that 
Mo[uo(b,1)] =0, =0, 


since My [io(b, and )] are continuous functionals of K, G, 
Qo, ‘++, 61 and since for AX = 0, these reduce to K, G, ao, «++, 61 respec- 
tively.* 

Combining these equations with the equation ¢(/) = 0 and (10), we find 


further that 
M,[uo(6,1)] =1, =1, 


i. e., \ = Lis a double characteristic value contrary to hypothesis. 

For \ < /, a similar discussion may be made. 

In order to prove Theorem II, consider the system with the modified coef- 
ficients (25), which satisfies the condition as to derivatives required by The- 
orem II’. The values of K, G, ao, ---, 51, coincide with those of K, G, 
respectively at the double =landatAX =1+ AX. 


* Cf. ‘Bocher, these Transactions, vol. 3 (1902), p. 208. 
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Let ¢(X) = 0 denote the corresponding characteristic equation so that 


o(l)=¢(l), 4) = (1+ Ad). 


If (A) does not preserve a negative sign in the vicinity of 1, we may assume 
that @ (1 + Ad) is positive or zero. Hence ¢(X) must vanish for /’ inasmuch 
as \ = lis a double value for (25) and d? ¢/dd? is negative at} =1. At the 
first value I’ of > 1) for which vanishes, it is clear that = 0 
since $(d) is negative in the vicinity of 1. Therefore, as we have seen, 
Mi [i (b,)] is negative for \ greater than but nearly equal to /. Hence 
this function vanishes between / and l’. This leads to a contradiction for AX 
sufficiently small just as in the proof of Theorem I. 

Likewise if Mo [uo(b, X)] or M, [ui (b,2)] have not the stated sign near a 
double value of \, we consider My [dio (b,X)] or M, and prove 
that these functions vanish between / and / + AX where Ad is arbitrarily 
small. Then, by allowing AX to approach zero, we are led to a contradiction. 

Thus Theorems I and II are established. 


4. ON CONDITIONS SUFFICIENT FOR UNIQUENESS 


We are now ready to state a theorem concerning the uniqueness of the 
characteristic values. 

THEOREM CONCERNING UNIQUENESS. For a system (1), (4), satisfying (A), 
(B), (C), there exists one and only one characteristic value between every pair 
of characteristic numbers of the auxiliary sturmian system (1), (3). If Xo, rx, 
Ae, +++ are the ordered characteristic numbers of this system (1), (3), and lo, l,, 
l,, +++ are the ordered characteristic numbers of the system (1), (4) (account 
being taken of their multiplicity), the following cases are possible: 


Sl <u <h Sh <3 < G, 
I,: <-:-- &, 
The conditions for these cases are respectively: 
et M,[uo(b,ro)] > 0, o(L£:+6€)>0, 
I,: M,[uo(b,ro)] > 0, o(£,+6€) <0, 
Il.: M,[uo(b,ro)] < 0, o(£,;+6€)>0, 
IL: < 0, o(£,+6€) <0. 
Proof. Case I. If M,[uo(b,X>)] > 0, then by (16), ¢ is positive or zero 
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at Ao, Ac, As, «++ and @ is negative at \,,A3, ---. Hence there exist at least 
two values (/;, in each double interval (Azp_1, such that 


Nop-1 < S Aap S < 


If there were additional values /; in a double interval, there would be at 
least two more, or four in all, since @(Aep-1) and @(A2p41) have the same 
sign. If we suppose that there is no double value, then at least two simple 
values must fall within (A2p_-1, Or (Agp, and at two successive such 
values A¢/AX has opposite signs. However, by Theorem I of § 3, Ad/AX 
would have the same sign as — Mo[uo(b,)] at both of these values. This 
is impossible, since the sign of Mo [wo (6, X)] does not change in this interval. 
Suppose now that a double value exists, necessarily for \ = Ae, by (13). 
Since (Aep-1) and ¢ are both negative and ¢(X) is negative near 
by Theorem II, § 3, there will be, if there are other roots in (Az2p-1, Azpi1), 
two simple roots lying in one of the intervals (Azp_-1, ) Or (Azp, But 
this has already been proved impossible. 

To complete the discussion of Case I, it needs only to be shown that if 
o(£; + €) > 0, we are led to a single value Jp of / in the interval (-£;, 1) 
such that Ao S lp S Ax, while if 6(£; + €) < 0, we are led to two roots lp, 1, 
such that £; < For then we have Case I, and Case I, 
respectively. 

When ¢(-£; + €) > 0, suppose first that (Ao) > 0. There is one and 
only one root in (Xo, A1) since Af@/AX has one and the same sign at all such 
roots which are simple. And there is no root in (-£,, \o), for if there were 
one root, there would necessarily be two which is clearly impossible. Thus 
we have Case I,. 

The possibility ¢ (Ao) = 0 may be excluded. For in this case \» would be a 
double value by (10), (11), and (16). Also ¢(X) is negative near A» by The- 
orem II, §3. Hence there must be a simple characteristic value for \ < Xo 
at which A@/Ad is negative. Consequently by Theorem I, § 3, Mo [wo(6, A) ] 
is negative for this value. However, by Theorem II, §3, Mo[wo(b,X)] is 
positive near Xo for A < Ao, so that Mo[wo(b,)] changes sign for X < Ao, 
which is impossible. 

When ¢(-£; + €) <0, we may apply precisely the same reasoning to the 
interval (£,, 1), as we have done to (Azp-1, Azp41) to prove that there are 
two roots I), /; restricted as in Case I,. 

Case II. If Mi[uo(b,)] < 0, then by (16), ¢ is negative at Xo, Ae, - 
and ¢ is positive or zero at \1, A3, -:-. It follows exactly as in Case I, that 
there exist two values l’, l’’, in each double interval (A2p, Azp42) such that 
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If 6(£; + €) <0, we have Case II,, since there are no characteristic values 
in (£,, Xo). If 6(£:+¢€) >0, we have only the possibility of a single 
characteristic value lo < Xo, and Case II, arises. 

On the basis of the foregoing Uniqueness Theorem, we can at once state the 
following Oscillation Theorem in the form of a corollary. 

Corotuary. [If the system (1), (4) satisfies (A), (B), (C), the characteristic 
function uy (x) belonging to the pth characteristic value will vanish p — 2,p — 1, 
P,pt+1l,orp+2 timesona<2a<b. 

Proof. For the system (1), (14), the auxilidry sturmian characteristic 
values, A,, are given by 

Mo[uo(b,r)] = 0. 


The Uniqueness Theorem states that l, lies on the interval (Ap-1, Apy1)- 
Now by Sturm’s Oscillation Theorem for the system: (1), (14), wo(2, Ap) 
vanishes exactly p times on a < x <b and so uwo(2,X) will vanish p — 1, 
p, or p +1 times on a < < b for SA S Apts, Or Uo (2, will have 
p — 1, p, or p+ 1 zeros on this same interval of the z-axis. But the zeros 
of u,(x) and uo(2, l,) separate one another or else coincide, so that u, (x) 
will have on a < x < beither p —2,p —1,p,p+1, or p + 2 zeros. 


AusTIN, TEXAS 


ON BOUNDARY VALUE PROBLEMS IN LINEAR DIFFERENTIAL 
EQUATIONS IN GENERAL ANALYSIS* 


BY 
T. H. HILDEBRANDT 


In a paper On a theory of linear differential equations in general analysis,t 
we considered the solution of the general linear differential equations 


(A) M,(n) = Dn — a — Jam = 0,7 
(B) M:z() = Dn — Jan = 0, 

(C) M2() — ao = Dyn — Jan — a = 0, 
and their adjoints 

(A’) Ni(4) = Dg + = 0, 
(B’) N2(4) = Di + Jia = 0, 

(C’) — a = D5 + Jia — ap = O. 


In the case of equations (A), (B), and (C) we found{ that the general solution 
of class §’ is expressible in the form 


+m, 


in which 7 is a particular solution of the equation in question, 7 is a solu- 
tion of equation (A) whose Fredholm determinant is not zero, and x is any 
function of the class &. Similarly, the general solution of equations (A’), 
(B’), and (C’) has the form 
= « + JSkijo + ii. 

We found further that there is a unique solution of each of these equations 
which satisfies an initial condition of the form 

= ko, 
where x; is any element of X, and xo any function of &. 

* Presented to the Society, April 6, 1917. 

+ These Transactions, vol. 18 (1917), pp. 73-96. This paper will be referred to 
as I in the sequel. With the exception of an additional condition on J in §§ 2 and 3, the 
postulates and properties of the classes Mt’, M’’, H, H’, and &K, and the functions a, q, 
and «x, and the operators J and D in the present paper are the same as in I. 

t Cf. I, loc. cit., pp. 84, 86, 87. 
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In Section 1 of this paper we consider the solutions of the equations (A), 
(B), and (C), and (A’), (B’), and (C’), whose values at two or more elements 
of X satisfy a linear relation, i. e., a linear boundary condition. Section 2 is 
devoted to the definition of adjoint systems of boundary conditions. Section 3 
derives the usual theorems concerning the interrelations of solutions of adjoint 
systems. They are in the main similar to those derived in the first three 
sections of the paper by Bécher on Applications and generalizations of the 
concept of adjoint systems* and include them as special cases. We have not 
given particular instances of the theory developed in the following pages. It 
is an easy matter to construct some of the more important ones along the 
lines outlined in §§ 12-14 of I.—| We might remark, however, that if we let 
Y= PS" =(0Spstjand =M” = the class of all continuous functions 
on the interval (0, 1) and the operator J be the definite integral /,'dp, we 
have below a theory of linear boundary value problems for linear integro- 
differential equations. 

1. Boundary conditions. The boundary conditions which we consider are 
linear in the values of the function 7 at two points 2; and 22 of the class X. 
The relations are further chosen so that the substitution of the general solution 
of the differential equations (4), (B), or (C) reduces the determination of 
the function «x to the solution of a linear general integral equation of the 
second or Fredholm type, i. e., one to which the Fredholm-Mooret theory is 


applicable. Such boundary conditions have the form§ 


S() = en(a1) + Jor n (a1) + dn (a2) + Joon (ae) = 0, 


S (7) = do; 


in which o;, o2, and oo are functions of the class @, and ¢ and d are con- 
stants for which ¢ +d +0. By dividing by ¢ + d, we get equivalent con- 
ditions with e + d= 1. We assume in the sequel that ¢ and d satisfy this 
relation. In so far as the expression S (7) + 01 + @¢ is of frequent occurrence, 
we denote it by So(m). 

In a similar way, we consider relative to the adjoint differential equations 
(A’), (B’), and (C’) boundary conditions of the form: 


T (4) = cf (a1) + JH (a1) 71 + di (ae) + JH (a2) 72 = 0, 


T ( ) 

*These Transactions, vol. 14 (1913), pp. 403-420. 

t Loe. cit., pp. 87-96. 

t{Cf. E. H. Moore: Bulletin of the American Mathematical Soci- 
ety, vol. 18 (1912), pp. 351-361. 

§ If the classes B®’ = $B” = [1,2,---,n]andJ = 3}, these conditions reduce to the bound- 
ary conditions considered by Bounitzky: Journal de Mathematiques, ser. 6, 
vol. 5 (1909), p. 68. 
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where 71, 72, and ty are functions of the class R, ande +d=1. We set 
To (4) T (4) + 71 + 


The following propositions are easily obtained by making the required 
substitutions and rearranging the terms properly. 


S(m + 2) = S(m) + 8(m), 
S(Jnx) = JS(n)k, 
S(« + =x +JSS8o(n)k, 
(4) So(m + m2) = So(m) + S(m), 
(5) So(xk ++ +S8o(n) +IS8o(n)k, 
(1’) T(t + G2) = + Tie), 
(2’) (Ji) = (4), 
(3’) T(x + Jj) = x + 
(4’) To (41 + = To(t1) + T (he), 
To(k +4 + = « + To(H) + 


These propositions immediately give rise to the following theorems relating 
to the systems 


(1) N = Dj+a+t+ Jia, 
So(7) = (0, 


M2(n) = Dn — Jan = 0, 
S() 0, 


Ms(n) = a, 


TueorEM I. A necessary and sufficient condition that the system (1) [(1’)] 
has a solution is that the Fredholm determinant of So (no) [ To (aio) be different 
from zero, no [fo] being a solution of equation (A) [(A’)] whose Fredholm deter- 
minant is not zero. 

For if no is a particular solution of M, (7) = 0, whose Fredholm determinant 
is not zero, the general solution can be written 


n=k+n0+Jnk, 
and so x must satisfy the equation 


So(n) = « + So(mo) + JSo(m0)« = 0. 


N2(4) = Di + Jia = 0, 
9 9 
No (4) = ao; 
3’ 
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This has the form of a reciprocal relation. It has a solution and one only 
if the Fredholm determinant of So (1) is not zero.* 

If S~* (mo) is the reciprocal of So (mo), then we have the 

Corotiary. If a solution n of class ’ of the system (1) exists, it has the 
form 


n = + no + Jno S* (m0). 


Denoting, for convenience, by mo [fo] a solution of equation (A) [(A’)] 
whose Fredholm determinant is not zero, we get in a similar way by using 
Propositions (3) [(3’)]: 

THEOREM II. A necessary and sufficient condition that there exist a solution 
of the system (2) [(2’)] which is not identically zero, is that the Fredholm deter- 
minant of So(no) [ To(fo)] be zero. If ui, ] are a com- 
plete set of linearly independent solutions of 


= 0, + Ju” = 9), 


then the general solution of the system (2) [(2’)] can be written 


n= (bn + bin) Mm 5 = bin + iv) |, 
where uw, [u,,] are any n functions of the class M’’ [M’]. 

In such a case, the system (2) [(2’)] is said to have n-fold compatibility. 

Using propositions (1) and (3) [(1’) and (3’)], and denoting by m [7:1] a 
particular solution of equation (C) [(C’)], we have 

TuHeoreM III. The system (3) [(3’)] has a unique solution if the Fredholm 
determinant of So(no) To (to) ts not zero. If this determinant is zero, then a 
necessary and sufficient condition for the existence of a solution of this system is 
that 


Jx(S(m) — oo) = 0, [J — = 0), 
for every solution x of the homogeneous equation 
+ JkSo(mo) = 0, [xk + JTo(fo)x = 0]. 


A similar set of propositions and theorems can be derived if the boundary 
conditions are linear relations in the values of the solution of the differential 
equations at n points of ¥: 2, ---,2,. For the equation (B) the conditions 
take the form 

S(n) = ( (em (am) + (2m)) = 0, =o, 
where 1, and are any functions of and ¢, ---, a set of con- 
stants for which 
+e =-1. 
* Cf. E. H. Moore: loc. cit., p. 354. 
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2. On adjoint systems. Before taking up the definition of adjoint systems, 
it will be necessary to add a further postulate on the operator J. In this and 
the succeeding paragraphs, we shall assume that J has the following property: 


If Jui uw’ = 0 for every p’ of the class Qt’, then uw, = 0, 
and 
if Ju’’ wo = O for every uw” of the class I’, then po = 0. 


This property corresponds to the definite property P») of Moore* when 
= 8" =$8, and M’ = = Mt. We shall therefore call it the definite 
property Py. It is equivalent to the following property, which is the form in 
which we shall have occasion to apply it. 

If Jxxo = 0 or Jxox = O for every x of the class R, then x = 0. 

We observe that J has this definite property in the finite case, and also the 
instances (a), (b), (c) but not (d) of I.t 


Consider now the system 
M2(n) = Dyn — Jan, 
(1) 
Si(9) = + (a1) + din(a2) + (22). 


We assume that the expression S; (7) is what we shall call linearly independent, 
i. e., there exists another linear combination of 7 (2,) and n (22) 


So(m) = ¢29 (a1) + Joon (a1) + don (a2) + Jo22 (22), 


with c; d, — c,d, + 0, such that the equations 8,(7) = 0 and S.(n) =0 
have the unique solution 


n(%i)=O0 and = 
We can always determine the constants a and b with b + 0 so that in 
S; = aS; + bS2, 
d; 2 Cis 
i. 
C2 d; = 


Evidently S; and S; will be completely equivalent to S; and S:, and we shall 
assume in the sequel that in the S: chosen, c, and dz have the character of 
ce, and d;. 


* Loc. cit., p. 361. 

Tt Loc. cit., pp. 89-92. : 

tIf = = (1, ---,n), and J = then this condition actually reduces to the 
linear independence of the boundary conditions S;. A discussion of the above definition for 
the case of linear integral expressions by L. J. Rouse will appear shortly. 
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On account of the assumption relative to S,; and S2, it is possible to solve 
the equations 


(a1) + Jou + + Jow (ae) = Si, 
+ Jor (a1) + + Jo22 (22) = So, 


for 7 (2,) and 7 (22) in terms of S; and S;. If we substitute in the expression 


Ji (#2) — Ji (21) (21) 


and collect the coefficients of S; and S2, these coefficients will be linear ex- 
pressions in 4 (2,) and 4 (22) of the form 


+ (a1) + di + JH (a2) = 
C2 (a1) + (a1) 721 + + SH (a2) 722 = 
We therefore have the identity 
Ji (22) — Ji (a1) 9 (a1) = JT1 (4) S2(n) — JT2(4)Si(n), 


and we note that on account of the condition J’*, the 7;(4) and 72 (4) will 
be uniquely defined, if S; and are given as functions of (2;) and (22). 

We call the expression N2(4) = Di + Jia together with the 7:(%4) so 
defined an adjoint system of system (1).* 

In order to obtain the relations between the constants and functions in 
S,, S. and 7,, T2, we can proceed as outlined in the definition. The same 
relations result however if we equate the coefficients in the defining identity. 
This yields 

= —1, di — dz cy 
dg — c,d, = 0, d, — dz 
from which we conclude that 
= dy, = = de, 
We find further 
dy — dg oy, + C2 — C1 T21 + J (711021 — T2101) = 0, 


dy O22 — dz + — dh + (711 O22 — T2102) = 


* This definition is a generalization of the definition due to Birkhoff, these Transac- 
tions, vol. 9 (1908), p. 173. Bounitzky (loc. cit., p. 73) gives a definition of adjoint which 
may be generalized as follows: S; (7) and 7; (4) are adjoint if for every n (2x1), 7 (22), 
9 (x1), 4 (22) for which S; (7) = O and 7, (4) = 0, we have 
(22) (42) — = 0. 

If S; and 7’; are adjoint according to our definition they will also be according to the Bounitzky 
definition, and conversely on account of J?0. The Bounitzky definition does not however 
seem to lend itself so readily to the derivation of the results of this section and the next. 
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C1 O21 — C2 011 + C2 T12 — C1 T22 (T1202 — T2011) = 0, 
C1 O22 — C2 + dz T12 — T22 + J (712 G22 — 722012) = 0. 
These equalities express the fact that the system of kernels 
C1 T21 — CoTi, d, — 711, 
C2 T12 — C1 dy — dy T22, 
are the reciprocals of the system of kernels 
— dion, dz — 
C1 921 — C2911; C1 — Co O12. 
If we add the last four equations, we get the following symmetrical relation 
O22 +62 +71 + T12 + (tu + Ti2) (G22 + G21) 
= 611 + O12 + T21 + T22 + (721 + (O11 + O12). 
With the aid of these relations we verify without much difficulty that 
(4) + (7) + JT 01 (4) Soz (1) — To2 (4) — Sor (7) 
— JT (4) Sor (n) = (a2) + + (22) (a2) 


— — n(a1) — (21), 
where 


Ta (4) = T1(4) +71 +712, (4) = T2(4) + 721 + 
So(n) = Si(n) ton Sor (7) = S2(n) +021 + 622. 


We therefore have 
TuHeorEeM I. IJ[f'S;(n) and 7T,(4) are adjoint, i. e., if we have identically 


Jij (x2) (22) — (a1) = JT, — 81, 
then we also have identically 
(a2) + + Ji (a2) 9 (42) — — — (21) 0 (21) 
= Ty + So + JST — Tor — Sor — JT 02 Sor. 


The same relations between the o and 7 in the light of their reciprocal 
character give 
TueoreM II. If 7, and T2 are adjoint to S; and S2 then from 
7,(4) =0 and T2(4) =0 
it follows uniquely that 


=0 and =0, 


i. €., (4) ts also linearly independent. 
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The element of arbitrariness which enters into the definition of the adjoint 
T, of S; is taken care of in the following 

TueroreM III. If Sz be replaced by any other S; which has the same character 
as S., and if we denote by T; and 7; the corresponding adjoint expressions, then 
there exist functions x, and x, of the class R such that 


T,= 7, 
and 
Tl, 
e., and T; are essentially equivalent. 
For if we solve the relations 
= + Jou (21) +d 9 (a2) + (22), 
2 = + Joon (a1) + (x2) + Jor (22) 
for n(2,) and n (22), we find 
n (a) = dy S; So + J 721 S; Se, 
n (ae ) = So = Si J T22 S; + JT 12 So. 
Substituting these values in 
= (21) + Jon (a1) + 9 (a2) + (22), 
we find 


S; = 8+ J (ke Si +k S:), 


where x; and x2 are functions of the class  , depending on 022, T11, 
T21, and tT22. We therefore have 
= JT: + J ko + Jk; JT; S; 
J ( T; JT; ) Se J ( T: JT, ko) Sy; 
from which on account of the uniqueness of 7; it follows that 
= T; + JT) 

In a similar way we obtain the other relation of the theorem. 

3. On the solutions of adjoint systems. Before taking up the relations 
which exist between solutions of adjoint systems we note the following lemmas. 


Lemma I. If n is a solution of equation (A) and § a solution of the adjoint 
equation (A’), then we have for every set of adjoint expressions S,, S., T,;, and T 


Tu (4) + Soz (7) + JT (4) Sez (7) To2 (4) + Soi (7) + JTo2(4) So 
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For by Theorem IV’ of § 11 of I,* we have for every pair of solutions 7 and 4 
of the adjoint equations (A) and (4’) 
+ + (a2) = + + JH (a1) 


Our lemma is then an immediate consequence of Theorem I of the preceding 
section. 
Lemma II. [If n is any solution of the system (2) [(2')] 


Mz(n)=90, Si(n) =0, [N2(4) = 0, T1(4) = 0], 
then S2(n) [ T2(4)] ts a solution of the equation 
So(n) + (4) S2(9) = 0, [T2(4) + = 0), 


where 4 [] is any solution of equation (A’) [(A)]. 

Let 7 be a solution of system (2). Then if mo is any solution of equation (A), 
no + 7 is also a solution of equation (A). Further by proposition (4) of § 1, 
we have 


Sor (mo + 7) = Sor(mo) + = Sor(no), 
So2 (no + 7) = (no) + S2(7). 


Applying Lemma I to the solutions yo and yo + 7, we have for every solution 7 
of the adjoint equation (A’) 


To (4) + Soe (no) + S2 (7) + JT 01 (4) Soz (m0) + JT (4) S2(n) 
= To2 (4) + Sor (m0) + J To2 (4) Sor (10) 
= T (4) + (no) + (4) So2 (m0) 


Ss (7) + JT (4) = 0. 


The proof for the lemma in the brackets runs parallel. 
We are now in position to derive the following theorems. 
TueorEM I. [f the system (1) 


Mi(n) = 0, Soi(n) = 0 


Hence 


has a unique solution, then the adjoint system (1') 
Mi(4) =90, Tula) =0 


also has a unique solution and conversely. 

Suppose 7 is the unique solution of system (1), and let 7» be any solution of 
equation (A’), whose Fredholm determinant is not zero. Then by Lemma I 
we shall have for 7 and jo 


So2 (7) + (40) + JT 01 (40) (7) = Toe (Ho). 


* Loc. cit., p. 87. 
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If we consider this as an integral equation in Sq (7), it has a unique solution, 
viz., the value of So2(7). Hence the Fredholm determinant of 7’: (40) is 
not zero, which by Theorem I of §1 is a necessary and sufficient condition 
for the existence of a unique solution of the system (1’). 

TuHeorEM II. If the system (2) 


M2(n) = 0, Si(y) =0 
has n-fold compatibility, then the system (2’) 
N2(4) = 0, T:(4) =0 


has also n-fold compatibility, and conversely. 
Let ui, «+, a be a complete system of linearly independent solutions of 
the homogeneous equation 


+ JSoi (mo) = 0, 
no being a solution of equation (A) whose Fredholm determinant is not zero. 
Then by Theorem II of § 1 
n= (im + bn) 


m=1 


u.. being arbitrary, is a general solution of system (2). By Lemma II 


S2(n) = (wn + (10) wn) 


m=1 
will satisfy the equation 
JST = 0, 
i. e., we shall have 


+ (9) + 01 (0) (in + (1) wn) ) = 


m=1 


Since the u,, are arbitrary, we conclude that the equation 
+ JT (fio) = 0, 


fio being a solution of equation (A’) whose Fredholm determinant is not zero, 


has a set of n solutions 
Mn + JSo2 (70) Mm 


These solutions are linearly independent. For suppose they were not. Then 
there would exist n constants ¢, ---, ¢,, such that 


n 


Cm Lin + J Sco (no) Hin 
1 


m=1 


Cm + J Soe (no ) Min ) 


n= 


Let 
n 
Cm Km = p’ 


m=1 
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Then for every uw” the function 
will satisfy both of the conditions 
Si(n) =0, S2(n) =0. 
But from our assumption relative to S, and S, it follows that 
=0, =0, 


from which we have yu’ = 0, which is contrary to our assumption that the p,, 
are linearly independent. It follows therefore that the expressions 


Bn + (0) Mm 
are linearly independent. 


Now if the linear equation 
+ JT (fio) = 0 
has at least n linearly independent solutions, then the adjoint linear integral 
equation 
+ dp” To (a0) = 0 
will also have at least n linearly independent solutions, so that by Theorem II 
of § 1, we know that the system (2’) has at least n-fold compatibility. 

By following through a similar line of reasoning, we show that if the system 
(2’) has m-fold compatibility, the system (2) has at least m-fold compatibility. 
But from this we conclude that m = n, which is the desired result. 

Since 0-fold compatibility of system (2) yields a unique solution for the 
system (1), Theorem I may be regarded as a corollary to Theorem II. 

We note further that in the course of the proof we have extended Lemma 
II to 


Lemma Ila. Lvery solution x of the equation 
K+ JST = 0, [x + JKSoi (m0) = 0] 


when jo [no] ts a solution of equation (A’) [(A)] whose Fredholm determinant is 
not zero, is expressible in the form S2() [| T2(4)], where n [4] ws a solution of 
the system (2) [(2’)]. 

TuHeoreM III. A necessary and sufficient condition for the existence of a 
solution of the non-homogeneous system (3) [(3’)] 


= a, Si(n) =o, = ao, T:(4) = 70] 
is that 


= —JT2(4)o0, = Jro82(n) | 


for every solution % [7] of the adjoint homogeneous system (2') [(2)]. 
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Suppose 7 is a solution of the system (3). Then if we let 4 be any solution 
of the adjoint system (2’) and apply Green’s Theorem G.* 


(0) + = Ja (a2) (a) 
JT; (4) — JT2(4)S8i(m), 
we get at once the condition of the theorem, i. e., the condition is necessary. 
On the other hand, the condition is sufficient. For if 4; be a particular 


solution of equation (C), and 4 be a solution of the system (2’), we get by the 
Green’s Theorem G, 


= — JT;(4)8:(m)- 


Applying the conditions of our theorem, we have for any particular solution 
of equation (C) and every solution 4 of system (2’) 


JT2(4)(Si(m) — 0) = 0. 
By Lemma IIa however, every solution of the equation 
K+ (no) = 0 
is expressible in the form 72 (4), i. e., we have for every such x 


Jx(Si(m) — oo) = 0. 


By Theorem III of $1, this is sufficient for the existence of a solution of 
system (3). 
ANN ArsBor, Miéu., 
February, 1917 


* Cf. I, loc. cit., p. 86. 


A FUNDAMENTAL SYSTEM OF FORMAL COVARIANTS MODULO 2 
OF THE BINARY CUBIC* 


BY 
OLIVER EDMUNDS GLENN 


If a binary form of order m, 
(ao, Mis ***s Amb21 , 22)”, 


whose coefficients are arbitrary variables, be transformed by the group A of 
all linear substitutions on 2;, x2, whose coefficients are least positive residues 
modulo p, a prime number, there is brought into existence an infinitude of 
rational integral functions of do, --+, Gm, #1, 2, which are invariants under 
the group. Whether this infinite system possesses the property of finiteness, 
in general, is an unsolved problem, but in this paper I show that, when the 
modulus is 2, the system of covariants of a cubic f; is finite and that the funda- 
mental set consists of twenty quantics. This system of covariants, five of 
which are pure invariants, is derived in explicit form. 

The methods of generation and proof of the completeness of the funda- 
mental set are developed from the point of view emphasized in a paper on the 
formal modular invariant theory, by the present writer, in volume 17 of these 
Transactions.ft These methods presuppose a knowledge of a funda- 
mental system of formal seminvariants of the given ground form; but this 
seminvariant system has been given previously{ for f; and the modulus 2, 
by Dickson. 


1. RESUME OF METHODS 


We recapitulate in (a), (b), (c), (d) certain processes, previously given, 
which are apropos in the developments relating to f;. In (e), (f) novel 
principles are developed. 


* Presented to the Society, April 28, 1917. 

+ The following papers by the present writer will be referred to by number: 

I. American Journal of Mathematics, vol. 37 (1915), p. 73. 

II. Bulletin of the American Mathematical Society, vol. 21 (1915), 
p. 167. 

III. These Transactions, vol. 17 (1916), p. 545. 

t Dickson, Madison Colloquium Lectures (1913), p. 53. 
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(a) Transvection. In I it was shown that transvection between a form, 
as f;, and the forms of the complete system of universal covariants of the 
group A (mod p) yields numerous formal concomitants mod p. These uni- 
versal covariants are* 


(1) L = 2x. — Q = (af — 2 + L. 


(b) Modular polars. In II the following invariantive operators (mod p) 
were introduced in connection with a binary m-ic fm: 


, 


m da ° 
m 


pe 
Odo 
(c) Concomitants of the first degree. Every form of order > 3 is shown in 
III to be reducible modulo 2 in terms of first degree invariants and first degree 
covariants of orders 1, 2, and 3. A set of concomitants mod 2 of f,,, of de- 
gree 1, is the following: 


K =a, + + Gn-1, Ky, = (a9 + K)a1 + (K + an) 


(2) 
Ke = ao + Kay + 23’. 


These three exist for all orders. If m is odd there exists a cubic covariant 
(3) = do + Ty aj te + + Om 22 (1itd:2=K). 


(d) Copied forms. If fm, gn are two binary forms and ¢@ is a system of 
modular concomitants of g,, then a system for any covariant F, of f,, con- 
structed on the model of a, is a system of concomitants of f,,. 

(e) Hexadic scales. There exist, in general, an infinite number of covariants 
mod p having one and the same seminvariant leading coefficient. Let 
Fy = Cor! + Cy ai 22 + +--+ be any covariant modulo 2, of odd order M, 
of fm, and construct concomitants of Fy on the models of K, Ki, Ke, Kms 
(cf. (2), (3)). These copied forms are concomitants of f,, viz., 


+ Cyr, D, = (Co + D)a + (D+ Cy) 22, 
(4) Dz, = Coai + Day + Cy 


= Coat + Ji xi + Joa + Cy (Ji t+Jd2=D). 


Lemma. Corresponding to any given cubic covariant mod 2 of fn, as, 
Fs = Covi + Craj ae, + + C3 22, 


there exists a definite cubic covariant T whose leading coefficient is the invariant 


* Dickson, these Transactions, vol. 12 (1911), p. 75, and Madison Colloquium 
Lectures (1913), p. 33. 
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+ C2, vizZ., 
= (C; C;) 2} (Co + C3) aj xe 
+ (Co + Co + Cz) ar + (C1 + C2) 23. 


(5) 


To prove the covariancy of I we assume that F; is a covariant, necessary 
and sufficient conditions for which are (1) homogeneity, (2) invariancy under 
the permutational substitution s = (ao Gm) (@1 Gm-1) (a1 and (3) in- 
variancy under the operation of transforming f, by T : 2; = 2 + 22, %2 = 2% 
(mod 2). Let the increments of C; (i = 0, ---, 3), whenf,, is transformed 
into f. by 7, be 6C; (¢ = 0, ---,3) respectively. Then if F; is the F3 
function constructed from f,, , 


F; = Co xi + (Co + Ci + ai x2 + (Co + Cy + 6C2) 21 
+ (Co + Co + C3 + + + 5C3) x} = F; (mod 2), 


whence follows 


(6) =0, 8, 8C3 = Co + Co (mod 2). 


Constructing I’ and applying (6) we obtain immediately I’ = T (mod 2), 
which proves the lemma. 
Application of this lemma to Fy; of (4) gives the covariant 


(7) Ty = Dz} + (Co + Cy + Ji) ai Xe + (Co + Cy + Jo) + Dz}. 


Observe that J;, Jz are interchanged by the substitution s. Hence, since 
J; + J. =D, the invariant leading coefficient of Ty, can contain no term 
ay a; a; --- which is left unaltered by s. This is true of all invariants which 
lead cubic covariants, for, if the leading coefficient Cy of F; is an invariant, 
we have Co = C3, and 6C; = 0. Then (6) gives 


Cy = Ci + C2 (mod 2), 


and, as the covariancy of F; requires that C; and C2 be interchanged by s, 
any term of C; which is left unaltered by s occurs also in C2 and therefore has 
a zero coefficient, modulo 2, in the sum Cy. 

We shall designate the six interrelated concomitants Fy, Fy;, ly, D, Di, De 
as the hexadic scale* for the covariant Fy of odd order M. Every covariant 
of odd order furnishes such a scale of concomitants. 

(f) Tetradic scales. If Fy is a covariant mod 2 of even order M no cubic 
covariant corresponding to Fy; exists, but we have the interrelated forms 
Fy, D, D,, De, which, accordingly, will be called a tetradic scale for Fy. 


* The term scale was used by Sylvester in the sense of a fundamental system, but this 
designation has become practically obsolete. 
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Note that the reducibility of Fy does not imply the reducibility of the 
forms in the scale for F,,, but the covariant I in any hexadic scale is reducible 
in terms of other concomitants in the scale, as follows: 


(8) Ty = QD; + Fs (mod 2). 


2. THE GENERAL SEMINVARIANT OF f3 
If 


the fundamental system of formal seminvariants modulo 2, given by Dickson, 
is composed of 
ao, = (ao + K + a3) az, 
®) A = a9 a3 + a B = aoa, + 
With these may be associated the following invariants:* 
K, IT =a, + doo, k = ao bo0, 
K*) + (A + (8 + a0. K + 
We have noticed previously, in III, the following syzygies connecting 
seminvariants. They result immediately from (9) and (10). , 


(10) 


a+@K+aI+k=0, 
(11) K?) (mod 2). 
+ (a +aK+I+A)(a,K + K*?) +9 =0 


It will be convenient, in this paper, t to abbreviate the invariant g + JA + A? 
as gi. Thus, 


(11,) g = 91 + IA + A*(mod 2). 
Any seminvariant ¢ of f;, being a polynomial in the seminvariants (9), is a 
polynomial in ay), K,J, 8, A, 
(ao, B, K, A, I). 


Hence when the congruences (11) are used as reducing moduli with respect 
to powers of a) and of 8, @ can be reduced to the form 


(12) + (To +Ti ao +T2 45) 8, 


where J;, T'; (¢ = 0,1, 2) are invariants expressed as polynomials in K, A, 


* Cf. Dickson, loc. cit.; and III, pp. 554, 555. 

+ The advantage of this change is that g contains terms which are symmetrical under s (§ 1) 
and hence g cannot be the leading coefficient of any covariant of odd order, whereas we shall 
determine a cubic covariant led by g:. 
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k, 1,91. Formula (12) is therefore the general form of a seminvariant leader 
of a formal covariant of f;. Another form of ¢ which we shall employ is 
obtained by introducing B into (12) through the defining relation B = 8 + A. 
Thus, 

(13) = Ro + Ri ao + Rz ap + (So + Si ay + S2 a5) B, 

where R;, S; (0 = 0, --+, 2), like Ji, T; (¢ = 0, ---, 2), are polynomials 
in K,A,k, I, g:, with numerical coefficients. 


3. CONSTRUCTION OF COVARIANTS OF f3 


The hexadic scale for the quantic f; itself, composes a system of first degree 
concomitants. The forms in this scale are f;, f3,G, K, Ki, Ke, where (cf. III, 
p. 555) 


G = Kaji + (do + a1 + a3) + (ao + a2 + a3) a1 + 
= QK, + fs (mod 2). 


The hessian of f; is both an algebraical and a formal modular covariant, viz., 


(14) 


H = sxj + Ax, x2 + (a; a3 + a3) x3 (s = +a?). 
The tetradic scale for QH consists of QH, H, G,, and A, where 
G, = (s + A)a, + (A + ay a3 + a3) [= (H, L)*}. 
A quadratic covariant led by 8 is 
(15) t= KK,+ H = Bai + (A + K?) a, 22 + (a2 a3 + a3) 23, 
and in the tetradic scale for Qt occurs 
as) =(B+A+4+ K*)a, + (A+ K? + a az + a3) 
= + KK,(mod 2). 
Apply to the forms just derived the modular operator 


— 2 2 2 
Bao + + a: + as Oa; 


Thus we get 
C, = wk, = (a5 + K?) a, + (K? + aj) a, 
(17) C. = wKe = + K? + a3 = Ki + K* Q(mod 2), 


P = wfs = a) + aj Xe + a2 + 


In order to perform the reductions necessary to isolate the complete system 
sought we shall need, among other forms, quadratic covariants led by the 


114 0. E. GLENN [January 


seminvariants do 8 and aj B respectively, and cubic covariants led by B, ao B, 
and a; B. These are derived by forming certain scales of concomitants, as 
follows: 

Construct the hexadic scale for the quintic covariant f;¢, and we have, 
neglecting reducible forms, 


Fs3 = do Bai + (a5 az + a, a3 + ao ay a3 + ao G2 a3) Xe 
+ (ao a5 + aj dz + ao dz a3 + Ag) + (a2 + a3 a3) a2, 

Dz = do Bai + (k + KA) 2x1 #2 + (a2 a3 + a3 a3) 22, 

D, = +k + KA)a, + (k + KA + + a3) 2x2. 

Next we form the hexadic scale for the quintic tP, giving 

= a2 + (a3 a3 + a5 ay + a5 a2 a3 + a5 at + ag aj + ag aj a3 
+ ao a; a; + aj ag) x7 v2 + (a) a3 + a9 a3 a3 + aj a2 a3 + a3 a3 
+ ay a; a3 + ay a2 a3 + a3 + ao a3) + (a2 a3 + a3 23, 

= ai Bui + (IA + A? + g) aire + (a2 a3 + a3 a3) 23, 

Di = (a B+ 1IA+ 4+ + + a a3 + az) a. 


Again, the scale for the quintic Q/, where / is the cubic 


(20) 1=QG, + Kfs = Bui + (ao a2 + ai a2 + a; a3 + a2) 
+ (do + + + aj) + (A + a2 a3 + 23, 


furnishes the covariants 


ly = + K? 21,22 + (A + ae a3 + a2) 23, 
ty (B+ K*)a, + (K? + A+ a. a3 + a3) 22, 


and it is now evident that cubic covariants, say F and F’ , led by the respective 
seminvariants a) B, a; B may also be constructed. For F is the cubic co- 
variant (the one not led by an invariant) in the hexadic scale for the quintic 
fsl. and F’ is the corresponding cubic in the scale for Pl,. We also write 
F and F’ explicitly, but they will be proved to be reducible (cf. (26)): 


(21) 


3 2 2 2 2 
= do + (a) dz + a3) Xe + (Ap a3 + Ay Ae Az) 
+ (ao a3 + a; a2 a3 + a2 a3 + a3 a3) 22, 
2 3 3 SS 2 2 2 2 
ay Buy + (a9 a3 + a) ay + + Gy + Ap + G2 a3 
2 3 3 3 2 2 2 
+ Ay Az + + Az + a2) + (0 aj + Ao 
2 3 2 3 2 2 2 


3 3 2 3 2 3 2 2 3 
+ dz + G2) Xz + (a0 43 + a3 + G2 a3 + a3) 22. 


F’ 
(22) 
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4. COVARIANTS OF fs; WHOSE LEADING COEFFICIENTS ARE INVARIANTS 


Reduction methods to be employed in the next section require an explicit 
knowledge of all covariants which have as leading coefficients pure invariants 
which are polynomials in the invariants K, A, k, I, g:, homogeneous as to 
ao, ***, 43. Wecan write the general polynomial in question in the form 


@=f(1,4A)+ +nys, 


where f(J, A) is a polynomial in J and A only, and y¥; (2 = 1, 2,3) are 
polynomials involving, in general, all five invariants. 

Lemma. No covariant of odd order exists having f (I, 4) as a leading coef- 
ficient. 

In proof of this we show that every polynomial in J and A alone, which is 
homogeneous in do, «++, a3, necessarily has a term which is left unaltered by 
the substitution s = (a9 a3) (a,a2). In fact the only symmetrical term in 
I? is a? a ,* while all terms in A’ are symmetrical under s. If J? A’ is the 
term containing the highest power of A in f(J, A), then the term 


T= az as 


certainly occurs in f with a numerical coefficient which is = 0 (mod 2). But, 
-as shown in § 1 (e), no covariant of odd order can be led by an invariant con- 
taining a term 7 unaltered by s. This proves the lemma. 

Lemma. There exists both a quadratic and a cubic covariant (but no linear 
covariant) led by each one of the three invariants K, k, gy. 

The quadratic covariants are the products of the respective invariants 
K,k,g by Q. 

The cubic led by K is the covariant G in (14). 

A cubic covariant led by k& is found by constructing the following polynomial 
in concomitants derived in § 3: 


T = QD, + Fs3 + AG; 
(23) T = kat + (a5 a, + ao aj + a2 a5 + a; a3 + a1 a} + ao a2 a3 
+ do dz + + + a3) + (a2 a3 + ad as 
+ aj ay + ao + aj + ao ay a3 + + + 3 
+ a) a3) x, x; + ke}. 
Finally, a cubic covariant whose leader is the invariant g; = JA + A? + g is 


the covariant led by an invariant, belonging to the hexadic scale for tP. 


= The expression z4 is the only term in the expansion of (z: + z2 + --- + 2;)" which has 
an odd coefficient and is left unaltered by r = (2: 27) (22 26) (23 25) (24). 
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By (8) this covariant, say FE, is reducible as follows (cf. (19)): 
(24) E = QD, + Fs (mod 2). 


Combining the preceding results in this section we conclude as in the fol- 
lowing 

Lemma. The most general form for a pure invariant leading coefficient of a 
covariant of f3, of odd order, is 


and the following quantic is a cubie covariant led by S: 


25) 


5. THE FUNDAMENTAL SYSTEM OF COVARIANTS OF f3 


Every covariant mod 2 of even order, of f;, is of the form R = ¢aj" + ---, 
where the leading coefficient is the @ function of (12). We shall construct, 
as a rational integral function of the concomitants which were derived in the 
two preceding sections, another covariant C whose seminvariant leading 
coefficient is @. Then R — C, although not vanishing in general, will always 
be congruent to the product of a covariant C’ of odd order 2h — 3 by L. 
This consideration, the method of which is due to Dickson, evidently furnishes 
a general reduction method,* since the same process can be applied to C’ and 
to the covariants analogous to C’, in succession. 

For example, if we subtract F;3; + Af; from F (ef. (18) and (22)), and 
reduce the factor C’ in the remainder, we get the first relation below. The 
second relation is given by performing similar operations in connection with 
the covariant F’ of (22), 

F = Fy3 + Afs + LKA, . 
(26) 
F’ = F;, + AP + LK?’ A (mod 2). 
In the general case, when R = gai" + --- (h 21), we find 
R= Jo + J; + Je C,Q* + +T,D.Q*" 
+T, D2 Q*" + LC’ (mod 2), 
where C’ = O if h = 1, and C’ is a covariant of odd order 2h — 3ifh > 1. 

Next, when R is of odd order 2h + 1 > 1, namely R = gai"*' + ---, we 
deduce by use of the form (13) of ¢, involving B, and the last lemma in 
Section 4, 

(28) R + Ri fs + Re P + Sol + Si F + S, F’)Q** + LC’ (mod 2), 


* Dickson, Proof of the finiteness of modular covariants, these Transactions, vol. 14 
(1913), p. 299. 
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where, if h = 1, C’ is an invariant, and, if h > 1, C’ is a covariant of even 
order 2h — 2. 

After we have applied these processes successively to the covariants C’, we 
shall have reduced all covariants of order + 1, excepting the irreducible con- 
comitants of orders 0, 1, 2, 3 in terms of which the covariants in formulas 
(27), (28) are explicitly constructed, as in formulas (14) to (26). 

Consider next the covariants which are linear in 2, 22, all being of the 
form 

A = + gi te. 
From (12) we have, identically, 
¢=N+d1(a9+ K) + J2(a) + K?) K*) 
+Ti(a98 +k + KA) +T2 (ai + gi) (mod 2), 


in which N is an invariant (cf. (11,)); 


Net A +7, KA + (mod 2). 


(29) 


We can construct a linear covariant led by each parenthesis in (29) (ef. (30)). 
No linear covariant is led by an invariant. Hence, assuming \ to be a co- 
variant, V = 0 (mod 2). Making use of the linear covariants in the various 
scales of concomitants in Section 3, we have, . 


(30) +7: Di + T2 Di (mod 2). 


Hence all linear covariants are reducible in terms of the invariants K, A, k, 
I, g; and the five covariants K,, Ci, 4, Di, D}. 

Giving attention, now, to the covariants entering the formulas (27), (28), 
(30), expressed as rational integral functions of other covariants by the explicit 
formulas given in Sections 3 and 4, we summarize our conclusions, in the 
following 

THEOREM. A fundamental system of formal covariants modulo 2 of the binary 
cubic quantic f; is composed of twenty forms, as follows: Five invariants K, A, 
k, I, 91; five linear covariants K,, Ci, ti, Di, Di; four quadratic covariants 
K., t, D2, D2; and four cubic covariants f;, P, Fs3, Fs3, together with the two 
universal covariants L, Q. 


6. SYZYGIES 


The syzygies connecting the members of this fundamental system are 
legionary. In the paper quoted as III above I gave a theorem which estab- 
lishes the existence and furnishes a method of construction of an infinitude 
of syzygies, although this theorem does not directly furnish all such relations. 
Each identity (11) connecting the fundamental seminvariants of f; furnishes a 


118 O. E. GLENN 


syzygy, for if we substitute in one of these, for each seminvariant, a covariant 
which the latter leads, paying attention to considerations of homogeneity, we 
get a reducible covariant which is congruent to L times a covariant C’, and 
reduction of C’ leads to a syzygy. The 2; below was constructed from the 
first relation in (11); 22 by the before-mentioned theorem: 


= Ki Kk, + K°K.Q+ KKiQ+ + IK2Q 
+ kQ? + K? K,L + 1K, L = 0 (mod 2), 
tfs + Kot Q + KK, K2Q + KKofs + AfsQ + Do L 
+ KK} L + K* LQ + AKLQ = 0 (mod 2). 
Since a syzygy is a polynomial in the fundamental concomitants we can prove 
that all are expressible in terms of a finite set of irreducible ones, 21, -- - 


by applying Hilbert’s theorem, replacing the equations customarily under- 
stood in this theorem by identical congruences modulo 2. That is, any 
syzygy = may be put in the form 


+ 0222+ --- +0, 2, (mod 2), 


where o;, --+, ¢,, being polynomials in the concomitants, are themselves 


concomitants of f;. 
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